Exploring Generative Error Correction for Dysarthric Speech Recognition

La Quatra, Moreno, Koudounas, Alkis, Salerno, Valerio Mario, Siniscalchi, Sabato Marco

May-27-2025–arXiv.org Artificial Intelligence

Despite the remarkable progress in end-to-end Automatic Speech Recognition (ASR) engines, accurately transcribing dysarthric speech remains a major challenge. In this work, we proposed a two-stage framework for the Speech Accessibility Project Challenge at INTERSPEECH 2025, which combines cutting-edge speech recognition models with LLM-based generative error correction (GER). We assess different configurations of model scales and training strategies, incorporating specific hypothesis selection to improve transcription accuracy. Experiments on the Speech Accessibility Project dataset demonstrate the strength of our approach on structured and spontaneous speech, while highlighting challenges in single-word recognition.

artificial intelligence, hypothesis, speech recognition, (12 more...)

arXiv.org Artificial Intelligence

May-27-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Italy (0.28)

Genre:
- Research Report (0.51)

Industry:
- Health & Medicine (0.48)

Technology:
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found