ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition

Apr-6-2023, 16:46:54 GMT–Neural Information Processing Systems

A challenging, unsolved problem in the speech recognition com(cid:173) munity is recognizing speech signals that are corrupted by loud, highly nonstationary noise. One approach to noisy speech recog(cid:173) nition is to automatically remove the noise from the cepstrum se(cid:173) quence before feeding it in to a clean speech recognizer. In previous work published in Eurospeech, we showed how a probability model trained on clean speech and a separate probability model trained on noise could be combined for the purpose of estimating the noise(cid:173) free speech from the noisy speech. We showed how an iterative 2nd order vector Taylor series approximation could be used for prob(cid:173) abilistic inference in this model. In many circumstances, it is not possible to obtain examples of noise without speech.

cid, learning dynamic noise model, robust speech recognition, (5 more...)

Neural Information Processing Systems

Apr-6-2023, 16:46:54 GMT

Conferences Web Page

Add feedback

Country:
- North America > United States > New York > New York County > New York City (0.07)

Technology:
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.96)