Speech Modelling Using Subspace and EM Techniques

Smith, Gavin, Freitas, João F. G. de, Robinson, Tony, Niranjan, Mahesan

Dec-31-2000–Neural Information Processing Systems

The speech waveform can be modelled as a piecewise-stationary linear stochastic state space system, and its parameters can be estimated using an expectation-maximisation (EM) algorithm. One problem is the initialisation ofthe EM algorithm. Standard initialisation schemes can lead to poor formant trajectories. But these trajectories however are important forvowel intelligibility. The aim of this paper is to investigate the suitability of subspace identification methods to initialise EM. The paper compares the subspace state space system identification (4SID) method with the EM algorithm. The 4SID and EM methods are similar in that they both estimate a state sequence (but using Kalman filters andKalman smoothers respectively), and then estimate parameters (but using least-squares and maximum likelihood respectively).

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Dec-31-2000

Conferences PDF

Add feedback

Country:
- North America > United States (0.29)
- Europe > United Kingdom
  - England (0.15)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.79)
  - Machine Learning > Statistical Learning (0.78)

Duplicate Docs Excel Report

Title
Speech Modelling Using Subspace and EM Techniques
Speech Modelling Using Subspace and EM Techniques

Similar Docs Excel Report more

Title	Similarity	Source
None found