Large Margin Hidden Markov Models for Automatic Speech Recognition
–Neural Information Processing Systems
We study the problem of parameter estimation in continuous density hidden Markov models (CD-HMMs) for automatic speech recognition (ASR). As in support vectormachines, we propose a learning algorithm based on the goal of margin maximization. Unlike earlier work on max-margin Markov networks, our approach is specifically geared to the modeling of real-valued observations (such as acoustic feature vectors) using Gaussian mixture models. Unlike previous discriminative frameworksfor ASR, such as maximum mutual information and minimum classification error, our framework leads to a convex optimization, without any spurious local minima. The objective function for large margin training of CD-HMMs is defined over a parameter space of positive semidefinite matrices. Its optimization can be performed efficiently with simple gradient-based methods thatscale well to large problems. We obtain competitive results for phonetic recognition on the TIMIT speech corpus.
Neural Information Processing Systems
Dec-31-2007
- Country:
- North America > United States
- California
- Alameda County > Berkeley (0.14)
- San Diego County (0.14)
- San Francisco County > San Francisco (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California
- North America > United States
- Technology: