New Tulu 3 claims to beat DeepSeekTülu 3 405B surpasses DeepSeek through advanced Reinforcement Learning techniques.
Meta Releases Llama 3.3: A Multilingual Model with Enhanced Performance and EfficiencyMeta's Llama 3.3 model enhances AI capabilities with improved efficiency, multilingual support, and safety features, setting new benchmarks in reasoning and coding.
X Releases Back-End Code and Weighting Data for Grok LLMGrok-1 large language model details revealedConcerns about AI and forced diversity highlighted
Gemini 1.5 is Google's next-gen AI model - and it's already almost readyGemini 1.5 is an improved version of Google's language model that is faster and more efficient.Gemini 1.5 has a context window of 1 million tokens, allowing it to handle larger queries and process more information at once.
New Tulu 3 claims to beat DeepSeekTülu 3 405B surpasses DeepSeek through advanced Reinforcement Learning techniques.
Meta Releases Llama 3.3: A Multilingual Model with Enhanced Performance and EfficiencyMeta's Llama 3.3 model enhances AI capabilities with improved efficiency, multilingual support, and safety features, setting new benchmarks in reasoning and coding.
X Releases Back-End Code and Weighting Data for Grok LLMGrok-1 large language model details revealedConcerns about AI and forced diversity highlighted
Gemini 1.5 is Google's next-gen AI model - and it's already almost readyGemini 1.5 is an improved version of Google's language model that is faster and more efficient.Gemini 1.5 has a context window of 1 million tokens, allowing it to handle larger queries and process more information at once.
Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable ModelsLlama 3.2 is Meta's first multimodal language model, allowing interaction with visual and voice data while offering customizable features.
Meta Open-Sources Large Concept Model, a Language Model That Predicts Entire SentencesMeta's LCM model innovatively uses sentence embeddings to enhance performance and reasoning capabilities beyond traditional token-based approaches.
Stanford AI Model Helps Locate Racist Deeds in Santa Clara County | KQEDUsing an open-source language model, the RegLab rapidly scanned millions of deed records, significantly reducing time and cost compared to traditional methods.
Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable ModelsLlama 3.2 is Meta's first multimodal language model, allowing interaction with visual and voice data while offering customizable features.
Meta Open-Sources Large Concept Model, a Language Model That Predicts Entire SentencesMeta's LCM model innovatively uses sentence embeddings to enhance performance and reasoning capabilities beyond traditional token-based approaches.
Stanford AI Model Helps Locate Racist Deeds in Santa Clara County | KQEDUsing an open-source language model, the RegLab rapidly scanned millions of deed records, significantly reducing time and cost compared to traditional methods.
Introducing LLaVA-Phi: A Compact Vision-Language Assistant Powered By a Small Language Model | HackerNoonLLaVA-Phi showcases the capabilities of smaller language models in multi-modal tasks with only 2.7B parameters.
NVIDIA Unveils NVLM 1.0: Open-Source Multimodal LLM with Improved Text and Vision CapabilitiesNVIDIA's NVLM 1.0 is a versatile open-source multimodal language model that enhances performance in both vision-language and text-only tasks after multimodal training.
Mistral AI and NVIDIA Launch Mistral NeMo 12BMistral AI and NVIDIA collaborated on Mistral NeMo 12B; a high-performance, customizable language model ideal for enterprise applications.
NVIDIA Unveils NVLM 1.0: Open-Source Multimodal LLM with Improved Text and Vision CapabilitiesNVIDIA's NVLM 1.0 is a versatile open-source multimodal language model that enhances performance in both vision-language and text-only tasks after multimodal training.
Mistral AI and NVIDIA Launch Mistral NeMo 12BMistral AI and NVIDIA collaborated on Mistral NeMo 12B; a high-performance, customizable language model ideal for enterprise applications.
BloombergOpenAI has partnered with Microsoft to develop a large-scale language model known as GPT-3.The partnership aims to improve the deployment and accessibility of GPT-3 in various applications.
OpenAI lets developers build real-time voice apps - at a substantial premiumOpenAI launched a real-time API for more interactive, spoken-language communication with its LLMs.
OpenAI makes ChatGPT smarter and quicker for paying usersGPT-4 Turbo by OpenAI focuses on improving writing, math, logical reasoning, and coding skills with a more conversational language approach.
BloombergOpenAI has partnered with Microsoft to develop a large-scale language model known as GPT-3.The partnership aims to improve the deployment and accessibility of GPT-3 in various applications.
OpenAI lets developers build real-time voice apps - at a substantial premiumOpenAI launched a real-time API for more interactive, spoken-language communication with its LLMs.
OpenAI makes ChatGPT smarter and quicker for paying usersGPT-4 Turbo by OpenAI focuses on improving writing, math, logical reasoning, and coding skills with a more conversational language approach.
Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space | HackerNoonA bilingual Arabic-Hebrew language model using transliteration shows promising effectiveness, outperforming Arabic-only script models despite a smaller training dataset.
Grok-2 Beta Version Released on X PlatformGrok-2 and Grok-2 mini outperform previous models, achieving higher scores in various academic benchmarks and introducing advanced features for users.
Send AI to space to spill human secrets to the aliens, scientists urge - but is it worth the risk?AI could be used to communicate with potential alien life by creating a language learning model for extraterrestrials to engage with.
Spain to create AI model in local languages for important national projectCreation of a large language model in Spanish and co-official languagesFocus on cross-sector collaboration and international outreach
Send AI to space to spill human secrets to the aliens, scientists urge - but is it worth the risk?AI could be used to communicate with potential alien life by creating a language learning model for extraterrestrials to engage with.
Spain to create AI model in local languages for important national projectCreation of a large language model in Spanish and co-official languagesFocus on cross-sector collaboration and international outreach
X Releases Back-End Code and Weighting Data for its 'Grok' LLMGrok-1 released: 314 billion parameter model architecture and weights available.Risks of woke AI: Concerns about forced diversity in AI leading to extreme outcomes.