Google calls Gemini a 'natively multimodal' model, meaning it can learn from data beyond just text, also slurping up insights from audio, video, and images.
[
Collection
]
[
|
...
]
Google calls Gemini a 'natively multimodal' model, meaning it can learn from data beyond just text, also slurping up insights from audio, video, and images.