Google's Genie model creates interactive 2D worlds from a single image
Briefly

...Genie does something altogether different, converting images into 'interactive, playable environments that can be easily created, stepped into, and explored.'
Instead, the system treats its starting image (or images) as frames of a video and generates a best guess at what the entire next frame (or frames) should look like when given a specific input.
Read at Ars Technica
[
add
]
[
|
|
]