Neural Voice Cloning: Teaching Machines to Generate Speech

Mar-6-2018, 14:16:20 GMT–#artificialintelligence

At Baidu Research, we aim to revolutionize human-machine interfaces with the latest artificial intelligence techniques. Our Deep Voice project was started a year ago, which focuses on teaching machines to generate speech from text that sounds more human-like. Beyond single-speaker speech synthesis, we demonstrated that a single system could learn to reproduce thousands of speaker identities, with less than half an hour of training data for each speaker. This capability was enabled by learning shared and discriminative information from speakers. We were motivated to push this idea even further, and attempted to learn speaker characteristics from only a few utterances (i.e., sentences of few seconds duration).

artificial intelligence, machine learning, neural voice cloning, (3 more...)

#artificialintelligence

Mar-6-2018, 14:16:20 GMT

News Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.81)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found