Sora's inability to generate accurate gymnastic videos highlights the difficulty in teaching AI models to grasp complex physical movements and representations.
Menlo Ventures principal investor Deedy Das explains that Sora relies on a transformer model architecture similar to language models, limiting its understanding of physics.
Collection
[
|
...
]