The Intelligent Voice 2016 Speaker Recognition System
Khosravani, Abbas, Glackin, Cornelius, Dugan, Nazim, Chollet, Gérard, Cannings, Nigel
We trained on each acoustic feature a full covariance, genderindependent UBM model with 2048 Gaussians followed by a 600-dimensional i-vector extractor to establish our MFCCand PLP-based i-vector systems. The unlabeled set of development data was used in the training of both the UBM and the i-vector extractor. The open-source Kaldi software has been used for all these processing steps [20]. It has been shown that successive acoustic observation vectors tend to be highly correlated. This may be problematic for maximum a posteriori (MAP) estimation of i-vectors. To investigating this issue, scaling the zero and first order Baum-Welch statistics before presenting them to the i-vector extractor has been proposed. It turns out that a scale factor of 0.33 gives a slight edge, resulting in a better decision cost function [10]. This scaling factor has been performed in training the i-vector extractor as well as in the testing.
Nov-2-2016
- Country:
- Asia > Singapore (0.04)
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Spain > Basque Country
- Biscay Province > Bilbao (0.04)
- Finland > North Karelia
- Joensuu (0.05)
- United Kingdom > England
- Genre:
- Research Report (0.64)
- Technology: