Baidu Deep Voice explained: Part 1 -- the Inference Pipeline – Athelas
This post is the first in what I hope to be a series covering recently published ML/AI papers that I think are particularly important. Some of the ideas in these papers are fairly intuitive and I hope I'm able to communicate some of that intuition in this format. For the first paper, I'll be covering Baidu's Deep Voice paper that applies Deep Learning to Text to Speech Systems. Recently, Andrew Ng's Baidu AI Team released an impressive paper on a new Deep Learning based system for converting text to speech. An example of the speech that Baidu's paper is able to produce is shown below.
Mar-14-2017, 22:10:04 GMT
- Technology: