Meta's Voicebox AI is a Dall-E for text-to-speech

Jun-16-2023, 15:00:21 GMT–Engadget

Today, we are one step closer to the immortal celebrity future we have long been promised (since April). Meta has unveiled Voicebox, its generative text-to-speech model that promises to do for the spoken word what ChatGPT and Dall-E, respectfully, did for text and image generation. Essentially, its a text-to-output generator just like GPT or Dall-E -- just instead of creating prose or pretty pictures, it spits out audio clips. Meta defines the system as "a non-autoregressive flow-matching model trained to infill speech, given audio context and text." It's been trained on more than 50,000 hours of unfiltered audio.

dall-e, meta, speech, (6 more...)

Engadget

Jun-16-2023, 15:00:21 GMT

News Web Page

Add feedback

Industry:
- Media (0.32)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.83)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found