The Intelligent Voice 2016 Speaker Recognition System

Khosravani, Abbas, Glackin, Cornelius, Dugan, Nazim, Chollet, Gérard, Cannings, Nigel

Nov-2-2016–arXiv.org Machine Learning

We trained on each acoustic feature a full covariance, genderindependent UBM model with 2048 Gaussians followed by a 600-dimensional i-vector extractor to establish our MFCCand PLP-based i-vector systems. The unlabeled set of development data was used in the training of both the UBM and the i-vector extractor. The open-source Kaldi software has been used for all these processing steps [20]. It has been shown that successive acoustic observation vectors tend to be highly correlated. This may be problematic for maximum a posteriori (MAP) estimation of i-vectors. To investigating this issue, scaling the zero and first order Baum-Welch statistics before presenting them to the i-vector extractor has been proposed. It turns out that a scale factor of 0.33 gives a slight edge, resulting in a better decision cost function [10]. This scaling factor has been performed in training the i-vector extractor as well as in the testing.

machine learning, pattern recognition, speaker and language recognition workshop, (12 more...)

arXiv.org Machine Learning

Nov-2-2016

arXiv.org PDF

Add feedback

Country:
- Europe > Finland (0.16)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Pattern Recognition > Speech Recognition (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found