OpenAI says it can clone a voice from just 15 seconds of audio

Mar-29-2024, 19:03:56 GMT–Engadget

OpenAI just announced that it recently conducted a small-scale preview of a new tool called Voice Engine. This is a voice cloning technology that can mimic any speaker by analyzing a 15-second audio sample. The company says it generates "natural-sounding speech" with "emotive and realistic voices." The technology is based on the company's pre-existing text-to-speech API and it has been in the works since 2022. OpenAI has already been using a version of the toolset to power the preset voices available in the current text-to-speech API and the Read Aloud feature. There are a bunch of samples on the company's official blog and they sound eerily close to the real thing.

clone, just 15, openai, (3 more...)

Engadget

Mar-29-2024, 19:03:56 GMT

News Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.73)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (0.95)
    - Chatbot (0.95)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.95)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found