ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition
Frey, Brendan J., Kristjansson, Trausti T., Deng, Li, Acero, Alex
–Neural Information Processing Systems
A challenging, unsolved problem in the speech recognition community isrecognizing speech signals that are corrupted by loud, highly nonstationary noise. One approach to noisy speech recognition isto automatically remove the noise from the cepstrum sequence beforefeeding it in to a clean speech recognizer. In previous work published in Eurospeech, we showed how a probability model trained on clean speech and a separate probability model trained on noise could be combined for the purpose of estimating the noisefree speechfrom the noisy speech. We showed how an iterative 2nd order vector Taylor series approximation could be used for probabilistic inferencein this model. In many circumstances, it is not possible to obtain examples of noise without speech.
Neural Information Processing Systems
Dec-31-2002