#multimodal-capabilities

[ follow ]
#gpt-4o
ITPro
2 weeks ago
Data science

Everything you need to know about GPT-4o, including pricing, features, and how to get access

OpenAI introduces GPT-4o, a multimodal language model with text, voice, and video capabilities, enhancing human-computer interaction. [ more ]
The Verge
2 weeks ago
Data science

OpenAI releases GPT-4o, a faster model that's free for all ChatGPT users

GPT-4o is faster and boosts capabilities in text, vision, and audio.
Free for all users, paid users have up to five times capacity limits. [ more ]
moregpt-4o
InfoQ
3 months ago
Artificial intelligence

Google announces multi-modal Gemini 1.5 with million token context length

Gemini 1.5 enhances AI efficiency with MoE architecture and expanded context window.
Gemini 1.5 offers multimodal capabilities for diverse interactions like processing long documents and movies. [ more ]
Engadget
5 months ago
Artificial intelligence

The Ray-Ban Meta smart glasses are getting AI-powered visual search features

Ray-Ban Meta smart glasses will receive upgrades to its AI assistant, including real-time information support and multimodal capabilities.
Meta AI will now be able to answer questions based on the user's environment and provide contextual information. [ more ]
[ Load more ]