Application of Variational Bayesian Approach to Speech Recognition

Watanabe, Shinji, Minami, Yasuhiro, Nakamura, Atsushi, Ueda, Naonori

Dec-31-2003–Neural Information Processing Systems

Application of V ariational Bayesian Approach to Speech Recognition Shinji Watanabe, Y asuhiro Minami, Atsushi Nakamura and Naonori Ueda NTT Communication Science Laboratories, NTT Corporation 2-4, Hikaridai, Seika-cho, Soraku-gun, Kyoto, Japan {watanabe,minami,ats,ueda}@cslab.kecl.ntt.co.jp Abstract In this paper, we propose a Bayesian framework, which constructs shared-state triphone HMMs based on a variational Bayesian approach, and recognizes speech based on the Bayesian prediction classification; variational Bayesian estimation and clustering for speech recognition (VBEC). An appropriate model structure with high recognition performance can be found within a VBEC framework. Unlike conventional methods, including BIC or MDL criterion based on the maximum likelihood approach, the proposed model selection is valid in principle, even when there are insufficient amounts of data, because it does not use an asymptotic assumption. In isolated word recognition experiments, we show the advantage of VBEC over conventional methods, especially when dealing with small amounts of data. 1 Introduction A statistical modeling of spectral features of speech (acoustic modeling) is one of the most crucial parts in the speech recognition. In acoustic modeling, a triphone-based hidden Markov model (triphone HMM) has been widely employed.

artificial intelligence, machine learning, training data, (15 more...)

Neural Information Processing Systems

Dec-31-2003

Conferences PDF

Add feedback

Country:
- Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.24)

Genre:
- Research Report (0.46)

Industry:
- Information Technology (0.74)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (1.00)
  - Machine Learning > Learning Graphical Models
    - Directed Networks > Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
Application of Variational Bayesian Approach to Speech Recognition
Application of Variational Bayesian Approach to Speech Recognition

Similar Docs Excel Report more

Title	Similarity	Source
None found