
"Microsoft researchers are developing technologies for a new class of video AI agents to explore three-dimensional spaces before making decisions."
"The technology framework, called MindJourney, uses a range of AI technologies to understand and analyze 3D spaces, reason about the surroundings, and predict movement, the researchers wrote in a blog entry late last month. MindJourney includes video-generation systems, vision language models (VLMs), and reasoning techniques that can predict surroundings, patterns, and movement. These technologies are packaged around "world models" that simulate real-world surroundings."
Microsoft is developing MindJourney, a technology framework for a new class of video AI agents that explore three-dimensional spaces prior to making decisions. MindJourney combines video-generation systems, vision-language models (VLMs), and reasoning techniques to understand and analyze 3D environments. The framework uses predictive capabilities to anticipate surroundings, movement, and patterns within simulated world models. These world models simulate real-world surroundings to enable agents to reason about spatial layouts and future states. Integration across video generation, VLMs, and reasoning supports richer environment understanding and anticipatory behavior for safer and more informed decision-making.
Read at Computerworld
Unable to calculate read time
Collection
[
|
...
]