#multimodal-ai

[ follow ]
smart-glasses
livescience.com
6 days ago
Artificial intelligence

Meta just stuck its AI somewhere you didn't expect it - a pair of Ray-Ban smart glasses

AI integration could revolutionize smart glasses technology. [ more ]
WIRED
3 days ago
Artificial intelligence

Astra Is Google's Answer to the New ChatGPT

Google and OpenAI demonstrate impressive advancements in multimodal AI models according to MIT assistant professor Pulkit Agrawal. [ more ]
Gadgets 360
3 weeks ago
Artificial intelligence

Ray-Ban Meta Smart Glasses Can Now Tell You What You're Looking At

Ray-Ban Meta smart glasses now offer multimodal AI capabilities, including object identification and voice command controls. [ more ]
The Verge
3 weeks ago
Artificial intelligence

The Ray-Ban Meta Smart Glasses have multimodal AI now

Smart glasses are evolving with features like multimodal AI, enhancing user experiences. [ more ]
moresmart-glasses
Gemini
english.elpais.com
5 months ago
Artificial intelligence

Google launches Gemini, an AI model capable of outperforming humans in multitasking language comprehension

Google has launched Gemini, a multimodal AI platform that can process and generate text, code, images, audio, and video from different data sources.
Gemini outperforms humans in multitasking language understanding (MMLU) and has scored over 90% on the evaluation system. [ more ]
Engadget
5 months ago
Artificial intelligence

The Morning After: Google's Gemini is the company's answer to ChatGPT

Google introduces Gemini, its most advanced language model to date
Gemini is a multimodal AI that can understand and reason on various inputs [ more ]
WIRED
5 months ago
Artificial intelligence

Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI

Google has announced the AI model Gemini, which can process information in the form of text, audio, images, and video.
Gemini is described as a 'multimodal' model that can perform complex reasoning and combine information from different modalities. [ more ]
moreGemini
Artificial intelligence
New Atlas
1 week ago
Artificial intelligence

Google's medical AI destroys GPT's benchmark and outperforms doctors

AI models like Google's Med-Gemini are advancing to process diverse medical information, approaching real-world doctor capabilities. [ more ]
www.cnn.com
4 days ago
Artificial intelligence

OpenAI unveils newest AI model, GPT-4o

GPT-4o will enhance ChatGPT with memory capabilities, real-time translation, and text-vision interaction, simplifying accessibility for all users. [ more ]
The Verge
3 days ago
Artificial intelligence

Project Astra is the future of AI at Google

Next-gen bots like Google's Project Astra aimed to be truly useful assistants. [ more ]
www.housingwire.com
1 day ago
Artificial intelligence

RealReports enhances property document analysis with new multimodal AI feature

RealReports unveiled a new feature for its AI assistant Aiden, using multimodal AI to summarize property documents quickly and efficiently. [ more ]
Engadget
3 weeks ago
Artificial intelligence

Ray-Ban Meta smart glasses do the AI thing without a projector or subscription

The Ray-Ban Meta smart glasses now feature multimodal AI, enhancing their functionality and interaction with users. [ more ]
Entrepreneur
4 weeks ago
Artificial intelligence

Meta AI Unveils First Two Versions of Llama 3 | Entrepreneur

Meta released Llama 3 models, enhancing Meta AI's capabilities to be more intelligent and diverse. [ more ]
moreArtificial intelligence
The Verge
6 days ago
Data science

OpenAI could debut a multimodal AI digital assistant soon

OpenAI is developing a new multimodal AI model with improved image and audio interpretation capabilities. [ more ]
InfoQ
1 month ago
Data science

Google Trains User Interface and Infographics Understanding AI Model ScreenAI

Google Research developed ScreenAI, a multimodal AI model for understanding infographics and user interfaces based on PaLI, achieving state-of-the-art performance. [ more ]
Engadget
1 month ago
Data science

The latest version of xAI's Grok can process images

xAI introduces Grok-1.5V, a multimodal AI model for processing visual information. [ more ]
TechRepublic
1 month ago
Artificial intelligence

Top 5 AI Trends to Watch in 2024

AI requires massive compute power for unstructured data
AI is impacting organizational structure, careers, and the artistic world [ more ]
The Verge
1 month ago
Artificial intelligence

Meta is adding AI to its Ray-Ban smart glasses next month

Meta bringing AI features to Ray-Ban smart glasses next month.
Glasses can perform translation and identification tasks, but not always accurately. [ more ]
Medium
2 months ago
UX design

VisionPro and beyond: protecting users in the era of spatial computing

Spatial computing advancements are rapidly evolving, with AI and mixed reality technologies leading the way.
User Experience Design is driven by psychology to create intuitive products. [ more ]
Gadgets 360
3 months ago
Artificial intelligence

Frame AI Glasses With Multimodal AI features, Unveiled by Brilliant Labs

Brilliant Labs has unveiled the Frame AI Glasses, an AI-powered wearable gadget that competes with similar products on the market.
The glasses have a micro-OLED display and multimodal AI capabilities, offering a wide range of functionalities. [ more ]
Engadget
3 months ago
Artificial intelligence

The Ray-Ban Meta smart glasses' new AI powers are impressive, and worrying

Multimodal AI allows Ray-Ban Meta smart glasses to respond to queries based on what the wearer is looking at.
Real-time information on Meta AI assistant is inaccurate and unreliable. [ more ]
The New Stack
4 months ago
Web design

Web Dev 2024: Fediverse Ramps Up, More AI, Less JavaScript

Increase in fediverse development
More AI development tool usage and multimodal AI [ more ]
The Conversation
5 months ago
Artificial intelligence

Google's Gemini: is the new AI model really better than ChatGPT?

Google DeepMind has announced Gemini, a new AI model designed to compete with OpenAI's ChatGPT.
Gemini is a multimodal model that can work with text, images, audio, and video as input and output. [ more ]
National Institute of Mental Health (NIMH)
3 months ago
Artificial intelligence

Multimodal Artificial Intelligence: Opportunities and Challenges in HIV Clinical Care

The goal of this concept is to encourage the use of multimodal artificial intelligence to accelerate HIV diagnosis, prevention, and treatment.
The concept aims to leverage advanced multimodal AI models to improve HIV prevention, treatment, and care by expanding capacities in clinical care and data-driven applications. [ more ]
[ Load more ]