[R] Deep Voice 2: Multi-Speaker Neural Text-to-Speech • r/MachineLearning
TL;DR Baidu's TTS system now supports multi-speaker conditioning, and can learn new speakers with very little data (a la LyreBird). I'm really excited about the recent influx of neural-net TTS systems, but all of the them seem to be too slow for real time dialog, or not publicly available, or both. Hoping that one of them gets a high quality open-source implementation soon!
May-25-2017, 20:35:23 GMT
- Technology: