NVIDIA's new AI model Fugatto can create audio from text prompts
Briefly

"We wanted to create a model that understands and generates sound like humans do," said Rafael Valle, emphasizing the human-like comprehension and generation abilities of Fugatto.
NVIDIA noted that Fugatto could quickly generate song prototypes for music producers who can easily edit styles, voices, and instruments based on their initial ideas.
The model's capabilities extend beyond its training, allowing it to combine tasks like generating speech with specific emotions and accents, showcasing its adaptability.
Fugatto can create dynamic sounds evolving over time, such as simulating a rainstorm that progresses across a landscape, making it versatile for various applications.
Read at Engadget
[
|
]