Data-Efficient ASR Personalization for Non-Normative Speech Using an Uncertainty-Based Phoneme Difficulty Score for Guided Sampling
Pokel, Niclas, Moure, Pehuén, Boehringer, Roman, Gao, Yingqiang
–arXiv.org Artificial Intelligence
Automatic speech recognition (ASR) systems struggle with non-normative speech from individuals with impairments caused by conditions like cerebral palsy or structural anomalies. The high acoustic variability and scarcity of training data severely degrade model performance. This work introduces a data-efficient personalization method that quantifies phoneme-level uncertainty to guide fine-tuning. We leverage Monte Carlo Dropout to estimate which phonemes a model finds most difficult and use these estimates for a targeted oversampling strategy. We validate our method on English and German datasets. Crucially, we demonstrate that our model-derived uncertainty strongly correlates with phonemes identified as challenging in an expert clinical logopedic report, marking, to our knowledge, the first work to successfully align model uncertainty with expert assessment of speech difficulty. Our results show that this clinically-validated, uncertainty-guided sampling significantly improves ASR accuracy, delivering a practical framework for personalized and inclusive ASR.
arXiv.org Artificial Intelligence
Sep-26-2025
- Country:
- Asia
- China > Shanghai
- Shanghai (0.05)
- Middle East > Jordan (0.04)
- China > Shanghai
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Greece (0.04)
- Switzerland > Zürich
- Zürich (0.15)
- France > Provence-Alpes-Côte d'Azur
- North America > Canada
- South America > Paraguay
- Asia
- Genre:
- Research Report > New Finding (0.54)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.48)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language (0.94)
- Speech > Speech Recognition (0.92)
- Information Technology > Artificial Intelligence