Adapting Pretrained ASR Models to Low-resource Clinical Speech using Epistemic Uncertainty-based Data Selection
Dossou, Bonaventure F. P., Tonja, Atnafu Lambebo, Emezue, Chris Chinenye, Olatunji, Tobi, Etori, Naome A, Osei, Salomey, Adewumi, Tosin, Singh, Sahib
–arXiv.org Artificial Intelligence
While there has been significant progress in ASR, African-accented clinical ASR has been understudied due to a lack of training datasets. Building robust ASR systems in this domain requires large amounts of annotated or labeled data, for a wide variety of linguistically and morphologically rich accents, which are expensive to create. Our study aims to address this problem by reducing annotation expenses through informative uncertainty-based data selection. We show that incorporating epistemic uncertainty into our adaptation rounds outperforms several baseline results, established using state-of-the-art (SOTA) ASR models, while reducing the required amount of labeled data, and hence reducing annotation costs. Our approach also improves out-of-distribution generalization for very low-resource accents, demonstrating the viability of our approach for building generalizable ASR models in the context of accented African clinical ASR, where training datasets are predominantly scarce.
arXiv.org Artificial Intelligence
Oct-8-2023
- Country:
- Africa (1.00)
- North America
- Canada > Quebec (0.14)
- United States (0.46)
- Genre:
- Research Report > Experimental Study (0.68)
- Industry:
- Health & Medicine
- Consumer Health (0.68)
- Health Care Technology (0.68)
- Therapeutic Area > Neurology (1.00)
- Health & Medicine
- Technology: