Continuous Speech Recognition by Linked Predictive Neural Networks

Tebelskis, Joe, Waibel, Alex, Petek, Bojan, Schmidbauer, Otto

Neural Information Processing Systems 

We present a large vocabulary, continuous speech recognition system based on Linked Predictive Neural Networks (LPNN's). The system uses neural networks as predictors of speech frames, yielding distortion measures which are used by the One Stage DTW algorithm to perform continuous speech recognition. The system, already deployed in a Speech to Speech Translation system, currently achieves 95%, 58%, and 39% word accuracy on tasks with perplexity 5, 111, and 402 respectively, outperforming several simple HMMs that we tested. We also found that the accuracy and speed of the LPNN can be slightly improved by the judicious use of hidden control inputs. We conclude by discussing the strengths and weaknesses of the predictive approach.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found