Adapting Pretrained ASR Models to Low-resource Clinical Speech using Epistemic Uncertainty-based Data Selection

Dossou, Bonaventure F. P., Tonja, Atnafu Lambebo, Emezue, Chris Chinenye, Olatunji, Tobi, Etori, Naome A, Osei, Salomey, Adewumi, Tosin, Singh, Sahib

Oct-8-2023–arXiv.org Artificial Intelligence

While there has been significant progress in ASR, African-accented clinical ASR has been understudied due to a lack of training datasets. Building robust ASR systems in this domain requires large amounts of annotated or labeled data, for a wide variety of linguistically and morphologically rich accents, which are expensive to create. Our study aims to address this problem by reducing annotation expenses through informative uncertainty-based data selection. We show that incorporating epistemic uncertainty into our adaptation rounds outperforms several baseline results, established using state-of-the-art (SOTA) ASR models, while reducing the required amount of labeled data, and hence reducing annotation costs. Our approach also improves out-of-distribution generalization for very low-resource accents, demonstrating the viability of our approach for building generalizable ASR models in the context of accented African clinical ASR, where training datasets are predominantly scarce.

dataset, ng train, recognition, (15 more...)

arXiv.org Artificial Intelligence

Oct-8-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Minnesota (0.04)
    - California > Los Angeles County
      - Los Angeles (0.14)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Sweden > Norrbotten County
    - Luleå (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
- Asia > Middle East
  - Republic of Türkiye (0.04)
- Africa
  - South Africa (0.04)
  - Kenya (0.04)
  - Sub-Saharan Africa (0.04)
  - Tanzania (0.04)
  - Nigeria (0.04)
  - Botswana (0.04)
  - Benin (0.04)
  - Malawi (0.04)
  - Uganda (0.04)
  - Rwanda (0.04)
  - Zimbabwe (0.04)
  - Ghana (0.04)
  - Niger (0.04)
  - Southern Africa (0.04)
  - Lesotho (0.04)

Genre:
- Research Report > Experimental Study (0.68)

Industry:
- Health & Medicine
  - Therapeutic Area > Neurology (1.00)
  - Health Care Technology (0.68)
  - Consumer Health (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found