Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition
Munkhdalai, Tsendsuren, Sim, Khe Chai, Chandorkar, Angad, Gao, Fan, Chua, Mason, Strohman, Trevor, Beaufays, Françoise
–arXiv.org Artificial Intelligence
Fast contextual adaptation has shown to be effective in improving Automatic Speech Recognition (ASR) of rare words and when combined with an on-device personalized training, it can yield an even better recognition result. However, the traditional re-scoring approaches based on an external language model is prone to diverge during the personalized training. In this work, we introduce a model-based end-to-end contextual adaptation approach that is decoder-agnostic and amenable to on-device personalization. Our on-device simulation experiments demonstrate that the proposed approach outperforms the traditional re-scoring technique by 12% relative WER and 15.7% entity mention specific F1-score in a continues personalization scenario.
arXiv.org Artificial Intelligence
Oct-6-2021
- Country:
- North America > United States (0.14)
- Genre:
- Research Report (0.64)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (0.65)
- Machine Learning (1.00)
- Natural Language (1.00)
- Speech > Speech Recognition (1.00)
- Information Technology > Artificial Intelligence