[R] Deep Voice 2: Multi-Speaker Neural Text-to-Speech • r/MachineLearning

May-25-2017, 20:35:23 GMT–#artificialintelligence

TL;DR Baidu's TTS system now supports multi-speaker conditioning, and can learn new speakers with very little data (a la LyreBird). I'm really excited about the recent influx of neural-net TTS systems, but all of the them seem to be too slow for real time dialog, or not publicly available, or both. Hoping that one of them gets a high quality open-source implementation soon!

machinelearning, social media, speech synthesis, (3 more...)

#artificialintelligence

May-25-2017, 20:35:23 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Vision > Optical Character Recognition (0.40)
    - Speech > Speech Synthesis (0.40)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found