
"Google described Omni as "where Gemini's ability to reason meets the ability to create." Interestingly, according to the company, "With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge.""
"Although Omni is "starting with video," Google said the new model can "create anything from any input," so presumably we'll see other media types generated by the tool within due time."
"Omni will also be available in model tiers, starting now with Gemini Omni Flash. The capability is coming to the Gemini app, Google Flow, and YouTube Shorts. It's not clear whether the web version of Gemini will support Omni, or whether you'll need to use the Flow interface via your browser."
"Today, Google announced a new AI video capability that will either help creatives produce higher-quality videos more easily, or vastly increase the amount of AI slop on YouTube. I'm betting it'll be a mix of both."
Gemini Omni is a new Google AI video capability designed to improve how videos are created from multiple inputs. The system combines images, audio, video, and text as input and generates high-quality videos grounded in Gemini’s real-world knowledge. The capability is positioned as a leap similar to the improvement seen with Nano Banana for image generation. Omni starts with video but is described as able to create anything from any input, suggesting expansion to other media types. It rolls out immediately in model tiers beginning with Gemini Omni Flash and becomes available in the Gemini app, Google Flow, and YouTube Shorts. Trust concerns may arise from AI avatars and increased low-quality content.
Read at ZDNET
Unable to calculate read time
Collection
[
|
...
]