Baidu's 'Deep Voice' AI System can Clone your Voice

#artificialintelligence 

Chinese internet search giant Baidu has developed an AI system that can clone an individual's voice! An year in the making, the text to speech system, called Deep Voice, can generate synthetic human voices using deep neural networks. According to the information shared by Baidu Research, they claim that it takes their trained model just three seconds to replicate and output a person's voice. Baidu's research team used voice cloning techniques to develop the AI system which they expect will have noteworthy applications in personalizing human-machine interface. Both Speaker Adaptation and Speaker Encoding (requiring minimal audio) provide quality performance and can be integrated in the Deep Voice model along with speaker embeddings without having to compromise the quality of the source audio. You can check out some audio samples provided by Baidu's Research team which consist of original and synthesized voices.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found