DeepSeek's V3 AI model gets a major upgrade - here's what's newDeepSeek's new V3-0324 model shows significant improvements in reasoning and web development but is recommended for simpler tasks.The AI startup aims to tackle benchmark saturation with advanced assessments.
Too Many AIs With Too Many Terrible Names: How to Choose Your AI Model | HackerNoonThe AI model landscape is cluttered and confusing, leading to limited user engagement despite numerous advancements.
DeepSeek's V3 AI model gets a major upgrade - here's what's newDeepSeek's new V3-0324 model shows significant improvements in reasoning and web development but is recommended for simpler tasks.The AI startup aims to tackle benchmark saturation with advanced assessments.
Too Many AIs With Too Many Terrible Names: How to Choose Your AI Model | HackerNoonThe AI model landscape is cluttered and confusing, leading to limited user engagement despite numerous advancements.
Apple says it'll use Apple Maps Look Around photos to train AIApple will utilize Look Around imagery to train models for various products, starting March 2025.Privacy remains a priority, as all images will be blurred to protect identities.
We need to stop caring about AI model releasesFocus on the practical benefits of AI models rather than just technical specifications.The excitement around model launches should center on their usefulness for users, not their technical details.
Have Some Spare Cash? You'll Need it for OpenAI's New APIOpenAI's o1-pro is a powerful, premium reasoning model for generative AI, available at high prices to cater to advanced users.
Mistral AI unveils small, powerful and open-source AI modelMistral AI's lightweight model outperforms larger competitors while increasing AI accessibility.
OpenAI launches new o3-mini reasoning model with a free ChatGPT versionOpenAI's o3-mini outperforms o1, offering faster and more accurate responses for users.
The Quarterly Review: Current's CTO Trevor Marshall reports on Model-Driven Success - TearsheetCurrent is enhancing model-driven deployments to improve consumer product offerings and performance.The firm has advanced its AI systems to improve transaction risk detection and combat fraud.
We need to stop caring about AI model releasesFocus on the practical benefits of AI models rather than just technical specifications.The excitement around model launches should center on their usefulness for users, not their technical details.
Have Some Spare Cash? You'll Need it for OpenAI's New APIOpenAI's o1-pro is a powerful, premium reasoning model for generative AI, available at high prices to cater to advanced users.
Mistral AI unveils small, powerful and open-source AI modelMistral AI's lightweight model outperforms larger competitors while increasing AI accessibility.
OpenAI launches new o3-mini reasoning model with a free ChatGPT versionOpenAI's o3-mini outperforms o1, offering faster and more accurate responses for users.
The Quarterly Review: Current's CTO Trevor Marshall reports on Model-Driven Success - TearsheetCurrent is enhancing model-driven deployments to improve consumer product offerings and performance.The firm has advanced its AI systems to improve transaction risk detection and combat fraud.
A new, challenging AGI test stumps most AI models | TechCrunchThe new ARC-AGI-2 test challenges AI models with puzzle-like problems to measure their general intelligence more effectively than previous tests.
Mark Zuckerberg says that Meta's Llama models have hit 1B downloads | TechCrunchMeta's Llama AI model family has reached 1 billion downloads in a significant growth period.
Meta has revenue sharing agreements with Llama AI model hosts, filing reveals | TechCrunchMeta CEO Mark Zuckerberg revealed that the company profits through revenue-sharing agreements for its Llama AI models, contrary to prior claims.
Mark Zuckerberg says that Meta's Llama models have hit 1B downloads | TechCrunchMeta's Llama AI model family has reached 1 billion downloads in a significant growth period.
Meta has revenue sharing agreements with Llama AI model hosts, filing reveals | TechCrunchMeta CEO Mark Zuckerberg revealed that the company profits through revenue-sharing agreements for its Llama AI models, contrary to prior claims.
Microsoft trains own AI models as alternative to OpenAIMicrosoft is seeking to lessen its reliance on OpenAI by developing its own AI models and exploring alternatives like xAI, Meta, and DeepSeek.
OpenAI unveils its new GPT-4.5 large language modelOpenAI's GPT-4.5 offers enhanced naturalness, emotional intelligence, and understanding, representing a significant upgrade over GPT-4.
OpenAI finally unveils GPT-4.5. Here's what it can doOpenAI's GPT-4.5 offers enhanced contextual understanding and emotional intelligence, marking a significant improvement over its predecessors.
"It's a lemon"-OpenAI's largest AI model ever arrives to mixed reviewsGPT-4.5 may be OpenAI's last traditional model, with a shift to simulated reasoning due to high costs and diminishing returns.
AI firms follow DeepSeek's lead, create cheaper models with "distillation"Distillation enables affordable access to AI models, benefiting developers and businesses using smaller devices like laptops and smartphones.
GPT-4.5 release for ChatGPT appears imminentOpenAI is set to preview GPT-4.5, which will initially be available only to Pro subscribers.
Microsoft trains own AI models as alternative to OpenAIMicrosoft is seeking to lessen its reliance on OpenAI by developing its own AI models and exploring alternatives like xAI, Meta, and DeepSeek.
OpenAI unveils its new GPT-4.5 large language modelOpenAI's GPT-4.5 offers enhanced naturalness, emotional intelligence, and understanding, representing a significant upgrade over GPT-4.
OpenAI finally unveils GPT-4.5. Here's what it can doOpenAI's GPT-4.5 offers enhanced contextual understanding and emotional intelligence, marking a significant improvement over its predecessors.
"It's a lemon"-OpenAI's largest AI model ever arrives to mixed reviewsGPT-4.5 may be OpenAI's last traditional model, with a shift to simulated reasoning due to high costs and diminishing returns.
AI firms follow DeepSeek's lead, create cheaper models with "distillation"Distillation enables affordable access to AI models, benefiting developers and businesses using smaller devices like laptops and smartphones.
GPT-4.5 release for ChatGPT appears imminentOpenAI is set to preview GPT-4.5, which will initially be available only to Pro subscribers.
Synchron's Brain-Computer Interface Now Has Nvidia's AISynchron's tech allows non-invasive brain-computer interface development, enabling data collection for AI models using avatars as time stamps.
Meta Stock Slides To Approach 3-Month Low Despite This AI MilestoneMeta stock declines despite AI milestone of 1 billion downloads for Llama models.Concerns over economic conditions impact investor sentiment on tech stocks, including Meta.
New AI models are running on just one or two chips. Will they trigger another DeepSeek moment for Nvidia?New AI models require fewer Nvidia chips for increased performance.
The most innovative companies in artificial intelligence for 2025The AI industry is transitioning towards real-time reasoning models as a crucial step towards achieving artificial general intelligence.
Microsoft makes DeepSeek's R1 model available on Azure AI and GitHubMicrosoft integrates DeepSeek's R1 AI model into Azure AI Foundry, enhancing accessibility and cost-effectiveness for developers.
Nvidia says new GPUs are the fastest for DeepSeek AI, which kind of misses the pointDeepSeek's success with lower-powered AI models signals potential challenges for Nvidia's dominance in the AI hardware market.
DeepSeek is driving demand for Nvidia's H200 chips, some cloud firms sayDeepSeek's R1 model spurs increased demand for Nvidia's H200 chips, defying Nvidia's stock sell-off.
New AI models are running on just one or two chips. Will they trigger another DeepSeek moment for Nvidia?New AI models require fewer Nvidia chips for increased performance.
The most innovative companies in artificial intelligence for 2025The AI industry is transitioning towards real-time reasoning models as a crucial step towards achieving artificial general intelligence.
Microsoft makes DeepSeek's R1 model available on Azure AI and GitHubMicrosoft integrates DeepSeek's R1 AI model into Azure AI Foundry, enhancing accessibility and cost-effectiveness for developers.
Nvidia says new GPUs are the fastest for DeepSeek AI, which kind of misses the pointDeepSeek's success with lower-powered AI models signals potential challenges for Nvidia's dominance in the AI hardware market.
DeepSeek is driving demand for Nvidia's H200 chips, some cloud firms sayDeepSeek's R1 model spurs increased demand for Nvidia's H200 chips, defying Nvidia's stock sell-off.
China puts American AI industry on notice yet again with 'Ernie X1,' Baidu's new open-source reasoning modelBaidu's new AI models aim to rival OpenAI and DeepSeek in performance while being significantly cheaper.
Baidu launches two new versions of its AI model Ernie | TechCrunchBaidu launches new AI models Ernie 4.5 and Ernie X1 with advanced capabilities and cost-effective positioning.
China puts American AI industry on notice yet again with 'Ernie X1,' Baidu's new open-source reasoning modelBaidu's new AI models aim to rival OpenAI and DeepSeek in performance while being significantly cheaper.
Baidu launches two new versions of its AI model Ernie | TechCrunchBaidu launches new AI models Ernie 4.5 and Ernie X1 with advanced capabilities and cost-effective positioning.
'Open' model licenses often carry concerning restrictions | TechCrunchGemma 3's licensing poses significant legal risks for commercial use, raising concerns among developers about long-term viability.
Google's Gemma 3 is an open source, single-GPU AI with a 128K context windowGemma 3 is highlighted as a top single-accelerator model by Google, excelling in chat capabilities and efficiency.
'Open' model licenses often carry concerning restrictions | TechCrunchGemma 3's licensing poses significant legal risks for commercial use, raising concerns among developers about long-term viability.
Google's Gemma 3 is an open source, single-GPU AI with a 128K context windowGemma 3 is highlighted as a top single-accelerator model by Google, excelling in chat capabilities and efficiency.
The future of AI isn't the model-it's the systemThe model alone is insufficient; practical value comes from its surrounding systems and capabilities.
Orb AI: The Search Engine of The Future Has Arrived - Presale Raises $1 Million In 3 WeeksOrb AI connects top AI models to revolutionize digital content generation.
Claude 3.7 Sonnet offers as much reasoning as you wantAnthropic's Claude 3.7 Sonnet balances cautious development with advanced reasoning capabilities amid competition from other LLMs.
What is sparsity? DeepSeek AI's secret, revealed by Apple researchersDeepSeek has revolutionized AI by utilizing sparsity, allowing for more cost-effective and efficient large language model development.
European AI alliance unveils its own LLM alternativeEurope is building OpenEuroLLM to create advanced multilingual AI models to compete with American and Chinese technology.The initiative promotes collaboration among leading European AI institutions for impactful public services.
IBM's Arvind Krishna Is Betting on Specialized AIIBM is shifting its focus to smaller, more reliable AI tools, moving away from large, costly models.
The future of AI isn't the model-it's the systemThe model alone is insufficient; practical value comes from its surrounding systems and capabilities.
Orb AI: The Search Engine of The Future Has Arrived - Presale Raises $1 Million In 3 WeeksOrb AI connects top AI models to revolutionize digital content generation.
Claude 3.7 Sonnet offers as much reasoning as you wantAnthropic's Claude 3.7 Sonnet balances cautious development with advanced reasoning capabilities amid competition from other LLMs.
What is sparsity? DeepSeek AI's secret, revealed by Apple researchersDeepSeek has revolutionized AI by utilizing sparsity, allowing for more cost-effective and efficient large language model development.
European AI alliance unveils its own LLM alternativeEurope is building OpenEuroLLM to create advanced multilingual AI models to compete with American and Chinese technology.The initiative promotes collaboration among leading European AI institutions for impactful public services.
IBM's Arvind Krishna Is Betting on Specialized AIIBM is shifting its focus to smaller, more reliable AI tools, moving away from large, costly models.
Habsburg AI Portrait StudiesHabsburg AI Portrait Studies examines how generative portraiture amplifies aesthetic characteristics of AI models, revealing structural implications of synthetic data usage.
US considers banning DeepSeek on government devicesDeepSeek's AI models raise user privacy concerns as they gain popularity, particularly with the R1 model outpacing OpenAI's algorithm.
Nasdaq Tumbles: NVIDIA, Broadcom and Microsoft Pull the Market DownThe Nasdaq decline is largely driven by investor fears surrounding the new DeepSeek-R1 AI model, affecting tech stocks significantly.
DeepSeek means a rethink on AI investmentDeepSeek's AI model challenges established notions of expensive GPU infrastructure, suggesting innovative methodologies can yield competitive performance.
Microsoft brings distilled DeepSeek R1 models to Copilot+ PCsDeepSeek has successfully expanded from mobile to Windows with Microsoft's backing, introducing optimized AI models for cloud and local development.
DeepSeek: Everything you need to know about the AI chatbot app | TechCrunchDeepSeek's rapid rise raises questions about the U.S. position in the global AI landscape.
Why everyone is freaking out about DeepSeekDeepSeek's affordable AI models challenge traditional cost perceptions in the industry.
US considers banning DeepSeek on government devicesDeepSeek's AI models raise user privacy concerns as they gain popularity, particularly with the R1 model outpacing OpenAI's algorithm.
Nasdaq Tumbles: NVIDIA, Broadcom and Microsoft Pull the Market DownThe Nasdaq decline is largely driven by investor fears surrounding the new DeepSeek-R1 AI model, affecting tech stocks significantly.
DeepSeek means a rethink on AI investmentDeepSeek's AI model challenges established notions of expensive GPU infrastructure, suggesting innovative methodologies can yield competitive performance.
Microsoft brings distilled DeepSeek R1 models to Copilot+ PCsDeepSeek has successfully expanded from mobile to Windows with Microsoft's backing, introducing optimized AI models for cloud and local development.
DeepSeek: Everything you need to know about the AI chatbot app | TechCrunchDeepSeek's rapid rise raises questions about the U.S. position in the global AI landscape.
Why everyone is freaking out about DeepSeekDeepSeek's affordable AI models challenge traditional cost perceptions in the industry.
ChatGPT doubled its weekly active users in under 6 months, thanks to new releases | TechCrunchChatGPT experienced accelerated growth, reaching 400 million weekly active users partly due to new models and features.
Google DeepMind, Cohere, and Twelve Labs join Sessions: AI | TechCrunchAI model development is rapidly evolving, presenting unique opportunities for startup founders to build and innovate in the field.
Amazon is working on a new 'reasoning' AI model that competes with OpenAI and AnthropicAmazon is launching a new AI model with advanced hybrid reasoning capabilities by June.The Nova model focuses on cost efficiency and aims to rank among the top 5 in performance benchmarks.
DeepSeek's breakthrough: A shift in AI economics?DeepSeek's R1 model challenges assumptions about AI training costs and competition, indicating a potential shift in industry dynamics.
Google Expands Gemini 2.0 AI Models With New Releases and Enhanced CapabilitiesGoogle launched Gemini 2.0 with enhanced AI models for developers and users globally, focusing on performance and cost efficiency.
Amazon is working on a new 'reasoning' AI model that competes with OpenAI and AnthropicAmazon is launching a new AI model with advanced hybrid reasoning capabilities by June.The Nova model focuses on cost efficiency and aims to rank among the top 5 in performance benchmarks.
DeepSeek's breakthrough: A shift in AI economics?DeepSeek's R1 model challenges assumptions about AI training costs and competition, indicating a potential shift in industry dynamics.
Google Expands Gemini 2.0 AI Models With New Releases and Enhanced CapabilitiesGoogle launched Gemini 2.0 with enhanced AI models for developers and users globally, focusing on performance and cost efficiency.
Open AI, Anthropic invite US scientists to experiment with frontier modelsAI partnerships with the US government grow, enhancing research while addressing AI safety.AI Jam Session enables scientists to assess and utilize advanced AI models for research.
DeepSeek R1 has taken the world by storm, but security experts claim it has 'critical safety flaws' that you need to know aboutDeepSeek R1's frontier reasoning model has critical safety flaws, achieving a 100% failure rate in blocking harmful prompts.
Open AI, Anthropic invite US scientists to experiment with frontier modelsAI partnerships with the US government grow, enhancing research while addressing AI safety.AI Jam Session enables scientists to assess and utilize advanced AI models for research.
DeepSeek R1 has taken the world by storm, but security experts claim it has 'critical safety flaws' that you need to know aboutDeepSeek R1's frontier reasoning model has critical safety flaws, achieving a 100% failure rate in blocking harmful prompts.
OpenEvidence v. Pathway: The Legal Battle Over AI Reverse EngineeringGenerative AI models can be reverse engineered, raising legal concerns about trade secrets and competitive practices.
Claude: Everything you need to know about Anthropic's AI | TechCrunchClaude 3.7 Sonnet is the most advanced AI model from Anthropic, enabling a unique hybrid reasoning capability.
OpenEvidence v. Pathway: The Legal Battle Over AI Reverse EngineeringGenerative AI models can be reverse engineered, raising legal concerns about trade secrets and competitive practices.
Claude: Everything you need to know about Anthropic's AI | TechCrunchClaude 3.7 Sonnet is the most advanced AI model from Anthropic, enabling a unique hybrid reasoning capability.
Anthropic raises $3.5B to fuel its AI ambitionsAnthropic raised $3.5 billion to enhance AI systems and user experience while aiming for substantial growth despite high development costs.
Anthropic's new Claude model can think both fast and slowAnthropic's Claude 3.7 Sonnet is a hybrid reasoning model that allows users to choose between quick or reflective responses.
Anthropic's latest flagship AI might not have been incredibly costly to train | TechCrunchAnthropic's Claude 3.7 Sonnet signifies a trend towards more affordable AI model training.While current models are cheaper to train, future AI developments are predicted to escalate in cost significantly.
Anthropic raises $3.5B to fuel its AI ambitionsAnthropic raised $3.5 billion to enhance AI systems and user experience while aiming for substantial growth despite high development costs.
Anthropic's new Claude model can think both fast and slowAnthropic's Claude 3.7 Sonnet is a hybrid reasoning model that allows users to choose between quick or reflective responses.
Anthropic's latest flagship AI might not have been incredibly costly to train | TechCrunchAnthropic's Claude 3.7 Sonnet signifies a trend towards more affordable AI model training.While current models are cheaper to train, future AI developments are predicted to escalate in cost significantly.
The business challenge for AI-native applicationsAI integration is crucial for building industry-defining products.The current AI competitive landscape includes AI-native applications and model providers.
Anthropic Launches the World's First 'Hybrid Reasoning' AI ModelLLMs like Claude can mimic reasoning but often struggle with complex tasks requiring step-by-step thought.Improved models now better handle coding problems and complex applications.
What Is a Diffusion LLM and Why Does It Matter? | HackerNoonInception Labs launched Mercury Coder, the first commercial diffusion LLM, promising faster processing speeds and innovative capabilities.
Anthropic Launches the World's First 'Hybrid Reasoning' AI ModelLLMs like Claude can mimic reasoning but often struggle with complex tasks requiring step-by-step thought.Improved models now better handle coding problems and complex applications.
What Is a Diffusion LLM and Why Does It Matter? | HackerNoonInception Labs launched Mercury Coder, the first commercial diffusion LLM, promising faster processing speeds and innovative capabilities.
What is retrieval-augmented generation? More accurate and reliable LLMsRAG enhances the accuracy of large language models by integrating external data sources, but it isn't a comprehensive solution.
Microsoft's Phi-4-multimodal AI model handles speech, text, and videoMicrosoft's new small language model aids developers in creating multimodal AI applications for lightweight devices.
Microsoft adds DeepSeek R1 to Azure AI Foundry and GitHubMicrosoft swiftly added DeepSeek R1 to Azure AI Foundry, showcasing its agility in incorporating new AI models amid competition.
Microsoft's Phi-4-multimodal AI model handles speech, text, and videoMicrosoft's new small language model aids developers in creating multimodal AI applications for lightweight devices.
Microsoft adds DeepSeek R1 to Azure AI Foundry and GitHubMicrosoft swiftly added DeepSeek R1 to Azure AI Foundry, showcasing its agility in incorporating new AI models amid competition.
Quora's Poe now lets users create and share custom AI-powered apps | TechCrunchPoe Apps enables users to easily create custom applications using various AI models, enhancing creativity and interactivity.
Claude 3.7 Sonnet debuts with "extended thinking" to tackle complex problemsxAI's Grok 3 and Claude 3.7 Sonnet are enhancing AI's decision-making and creativity capabilities.Claude Code represents a significant advancement in AI-assisted coding, improving developer efficiency.
Fiverr wants freelancers to create AI modelsFiverr allows creators to train AI on their own work while maintaining control over their creative rights.
Gemini 2.0 Family Expands with Cost-Efficient Flash-Lite and Pro-Experimental ModelsGoogle introduced Gemini 2.0 Flash-Lite as a cost-effective model optimizing text output while sacrificing some capabilities compared to its predecessors.
Perplexity lets you try DeepSeek R1 - without the security riskDeepSeek AI's language models raise concerns about data privacy and Chinese censorship, but Perplexity offers a workaround by hosting in Western data centers.
DeepSeek R1 Now Available on Azure AI Foundry and GitHub, Expanding AI Accessibility for DevelopersIntroduction of DeepSeek R1 enhances Microsoft's Azure AI Foundry portfolio, emphasizing advanced AI capabilities for enterprises.
Benchmarking ChatGPT, Qwen, and DeepSeek on Real-World AI Tasks | HackerNoonQwen 2.5 outperforms other AI models like ChatGPT and DeepSeek in coding, mechanics, and algorithmic precision.AI models now deliver significant performance improvements at lower investment costs.
Your Pizza Guy Is Now AIWyze's Palona chatbot personalizes sales interactions using advanced language models and emotional intelligence, ensuring relevant customer engagement.
Endor Labs Adds Ability to Identify Open Source AI Models to SCA Tool - DevOps.comEndor Labs expands its SCA tools to include risk detection for open-source AI models downloaded from Hugging Face.