Microsoft's VASA-1 AI model turns still images into lifelike talking faces with visual affective skills, using a single image and audio clip.
The quality of lip-sync, head movements, and facial features in Microsoft's VASA-1 AI model makes it stand out, showcasing hyper-realistic capabilities.
Microsoft acknowledges the potential misuse of VASA-1 for impersonation and misinformation, hence delaying the release until ensuring responsible usage and regulatory compliance.
[
add
]
[
|
|
...
]