Deep Speech 3: Even more end-to-end speech recognition - Baidu Research
Accurate speech recognition systems are vital to many businesses, whether they are a virtual assistant taking commands, video reviews that understand user feedback, or improve customer service. However, today's world-class speech recognition systems can only function with user data from third party providers or by recruiting graduates from the world's top speech and language technology programs. At Baidu Research, we have been working on developing a speech recognition system that can be built, debugged, and improved by a team with little to no experience in speech recognition technology (but with a solid understanding of machine learning). We believe a highly simplified speech recognition pipeline should democratize speech recognition research, just like convolutional neural networks revolutionized computer vision. Along this endeavor we developed Deep Speech 1 as a proof-of-concept to show a simple model can be highly competitive with state-of-art models.
Feb-16-2018, 17:40:21 GMT
- Technology: