#Multimodal AI

[ follow ]
#multimodal-ai
TechCrunch
3 days ago
Artificial intelligence

Meta AI can now understand and edit your photos | TechCrunch

Meta AI is enhancing photo editing and interaction capabilities, competing closely with Google and OpenAI.
Multimodal capabilities allow for photo sharing and inquiry-based interactions, enhancing user experience.
AI can edit images contextually, creating a dynamic way for users to interact with their photos. [ more ]
WIRED
3 days ago
Artificial intelligence

The Most Capable Open Source AI Model Yet Could Supercharge AI Agents

Molmo, an open source multimodal AI model, enhances accessibility for developers to create advanced AI agents that can perform useful tasks on computers. [ more ]
TechCrunch
1 week ago
Artificial intelligence

Mistral launches a free tier for developers to test its AI models | TechCrunch

Mistral AI launched a free tier for developers to experiment with its AI models, aiming to attract more users and reduce costs. [ more ]
eWEEK
3 weeks ago
Artificial intelligence

The Future of Generative AI (2024): 8 Predictions to Watch

Generative AI is rapidly becoming integral across industries, evolving with new applications, while posing challenges around job displacement and the need for workforce adaptation. [ more ]
IT Brief Australia
2 weeks ago
Artificial intelligence

Gartner: 40% of generative AI solutions to be multimodal by 2027

Gartner predicts that 40% of generative AI solutions will be multimodal by 2027, significantly increasing from just 1% in 2023. [ more ]
Computerworld
3 weeks ago
Artificial intelligence

The AI glasses market comes into focus

AI glasses market is diversifying with varying features and price points, emphasizing either innovation or affordability. [ more ]
TechCrunch
3 days ago
Artificial intelligence

Meta AI can now understand and edit your photos | TechCrunch

Meta AI is enhancing photo editing and interaction capabilities, competing closely with Google and OpenAI.
Multimodal capabilities allow for photo sharing and inquiry-based interactions, enhancing user experience.
AI can edit images contextually, creating a dynamic way for users to interact with their photos. [ more ]
WIRED
3 days ago
Artificial intelligence

The Most Capable Open Source AI Model Yet Could Supercharge AI Agents

Molmo, an open source multimodal AI model, enhances accessibility for developers to create advanced AI agents that can perform useful tasks on computers. [ more ]
TechCrunch
1 week ago
Artificial intelligence

Mistral launches a free tier for developers to test its AI models | TechCrunch

Mistral AI launched a free tier for developers to experiment with its AI models, aiming to attract more users and reduce costs. [ more ]
eWEEK
3 weeks ago
Artificial intelligence

The Future of Generative AI (2024): 8 Predictions to Watch

Generative AI is rapidly becoming integral across industries, evolving with new applications, while posing challenges around job displacement and the need for workforce adaptation. [ more ]
IT Brief Australia
2 weeks ago
Artificial intelligence

Gartner: 40% of generative AI solutions to be multimodal by 2027

Gartner predicts that 40% of generative AI solutions will be multimodal by 2027, significantly increasing from just 1% in 2023. [ more ]
Computerworld
3 weeks ago
Artificial intelligence

The AI glasses market comes into focus

AI glasses market is diversifying with varying features and price points, emphasizing either innovation or affordability. [ more ]
moremultimodal-ai
The New Stack
8 months ago
Web design

Web Dev 2024: Fediverse Ramps Up, More AI, Less JavaScript

Increase in fediverse development
More AI development tool usage and multimodal AI [ more ]
#Gemini
english.elpais.com
9 months ago
Artificial intelligence

Google launches Gemini, an AI model capable of outperforming humans in multitasking language comprehension

Google has launched Gemini, a multimodal AI platform that can process and generate text, code, images, audio, and video from different data sources.
Gemini outperforms humans in multitasking language understanding (MMLU) and has scored over 90% on the evaluation system. [ more ]
Engadget
9 months ago
Artificial intelligence

The Morning After: Google's Gemini is the company's answer to ChatGPT

Google introduces Gemini, its most advanced language model to date
Gemini is a multimodal AI that can understand and reason on various inputs [ more ]
WIRED
9 months ago
Artificial intelligence

Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI

Google has announced the AI model Gemini, which can process information in the form of text, audio, images, and video.
Gemini is described as a 'multimodal' model that can perform complex reasoning and combine information from different modalities. [ more ]
english.elpais.com
9 months ago
Artificial intelligence

Google launches Gemini, an AI model capable of outperforming humans in multitasking language comprehension

Google has launched Gemini, a multimodal AI platform that can process and generate text, code, images, audio, and video from different data sources.
Gemini outperforms humans in multitasking language understanding (MMLU) and has scored over 90% on the evaluation system. [ more ]
Engadget
9 months ago
Artificial intelligence

The Morning After: Google's Gemini is the company's answer to ChatGPT

Google introduces Gemini, its most advanced language model to date
Gemini is a multimodal AI that can understand and reason on various inputs [ more ]
WIRED
9 months ago
Artificial intelligence

Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI

Google has announced the AI model Gemini, which can process information in the form of text, audio, images, and video.
Gemini is described as a 'multimodal' model that can perform complex reasoning and combine information from different modalities. [ more ]
moreGemini
[ Load more ]