Wednesday , 30 April 2025
Home Kripto New NVIDIA AI Model Fugatto Creates Audio from Text Prompts
Kripto

New NVIDIA AI Model Fugatto Creates Audio from Text Prompts

New NVIDIA AI Model Fugatto Creates Audio from Text Prompts

NVIDIA has unveiled an experimental AI model, Fugatto, capable of generating audio from text prompts and modifying existing sound files. Officially named the Foundational Generative Audio Transformer Opus 1, the model is designed to provide a versatile solution for sound creation, described by NVIDIA as “a Swiss Army knife for sound.” Built by an international team of AI researchers, Fugatto’s capabilities extend across multiple languages and accents, enhanced by the diversity of its developers.

According to Rafael Valle, NVIDIA’s manager of applied audio research, the goal was to develop a model that approaches sound generation with human-like understanding. Fugatto enables applications ranging from rapid music prototyping to creating personalized language learning tools and dynamic audio assets for video games. For example, music producers can use the model to experiment with different voices, instruments, and styles, while game developers might customize in-game soundscapes to reflect player decisions.

NVIDIA’s Fugatto represents an exciting leap forward in generative AI, with its ability to craft complex, dynamic audio. While its practical applications remain to be tested at scale, the technology holds immense promise for creative industries, blending technical sophistication with artistic possibilities.

Beyond these use cases, the researchers discovered Fugatto could handle tasks outside its training scope. With minimal fine-tuning, the model can combine separate training instructions, such as generating emotionally expressive speech in specific accents or blending natural sounds like birdsong with the dynamic intensity of a thunderstorm. It can also produce audio that evolves over time, such as rainstorms traversing landscapes.

Despite its advanced capabilities, NVIDIA has yet to announce plans for public access to Fugatto. This development follows similar initiatives from tech giants like Meta, which introduced an open-source AI for sound creation, and Google, whose MusicLM tool generates music from text prompts via its AI Test Kitchen.

Related Articles

Elon Musk’s X Corp. Challenges Minnesota’s Political Deepfake Ban in Court
Kripto

Elon Musk’s X Corp. Challenges Minnesota’s Political Deepfake Ban in Court

X Corp., the social media platform owned by Elon Musk, is suing....

TikTok’s Rise Ignites a Global Race in Short-Form Video Content
Kripto

TikTok’s Rise Ignites a Global Race in Short-Form Video Content

Since its worldwide launch in 2016, TikTok has disrupted the social media...

India Tightens Grip on Cryptocurrency as Major Exchanges Face Challenges
Kripto

India Tightens Grip on Cryptocurrency as Major Exchanges Face Challenges

India’s cryptocurrency regulatory climate is still in flux and remains a huge...

Instagram Edits Outpaces CapCut with Over 7 Million Downloads in First Week
Kripto

Instagram Edits Outpaces CapCut with Over 7 Million Downloads in First Week

Instagram Edits, Meta’s newly released video creation app, had a bigger debut...