An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution

Lo, Tien-Hong, Chao, Fu-An, Wu, Tzu-I, Sung, Yao-Ting, Chen, Berlin

Apr-11-2024–arXiv.org Artificial Intelligence

Automated speaking assessment (ASA) typically involves automatic speech recognition (ASR) and hand-crafted feature extraction from the ASR transcript of a learner's speech. Recently, self-supervised learning (SSL) has shown stellar performance compared to traditional methods. However, SSL-based ASA systems are faced with at least three data-related challenges: limited annotated data, uneven distribution of learner proficiency levels and non-uniform score intervals between different CEFR proficiency levels. To address these challenges, we explore the use of two novel modeling strategies: metric-based classification and loss reweighting, leveraging distinct SSL-based embedding features. Extensive experimental results on the ICNALE benchmark dataset suggest that our approach can outperform existing strong baselines by a sizable margin, achieving a significant improvement of more than 10% in CEFR prediction accuracy.

assessment, classifier, proceedings, (14 more...)

arXiv.org Artificial Intelligence

Apr-11-2024

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia
  - Taiwan (0.05)
  - China > Hong Kong (0.04)
  - Singapore (0.04)
  - Philippines (0.04)
  - Thailand (0.04)
  - Indonesia (0.04)
  - Japan (0.04)
  - Pakistan (0.04)
  - South Korea (0.04)

Genre:
- Research Report (0.64)

Industry:
- Education > Educational Technology > Educational Software (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Speech > Speech Recognition (0.68)
  - Machine Learning > Neural Networks (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found