Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces
Kuhn, Korbinian, Kersken, Verena, Zimmermann, Gottfried
–arXiv.org Artificial Intelligence
Despite advances in Automatic Speech Recognition (ASR), transcription errors persist and require manual correction. Confidence scores, which indicate the certainty of ASR results, could assist users in identifying and correcting errors. This study evaluates the reliability of confidence scores for error detection through a comprehensive analysis of end-to-end ASR models and a user study with 36 participants. The results show that while confidence scores correlate with transcription accuracy, their error detection performance is limited. Classifiers frequently miss errors or generate many false positives, undermining their practical utility. Confidence-based error detection neither improved correction efficiency nor was perceived as helpful by participants. These findings highlight the limitations of confidence scores and the need for more sophisticated approaches to improve user interaction and explainability of ASR results.
arXiv.org Artificial Intelligence
Mar-19-2025
- Country:
- Oceania
- New Zealand (0.04)
- Australia
- New South Wales > Sydney (0.04)
- Queensland
- Cairns Region > Cairns (0.04)
- Brisbane (0.04)
- North America
- United States
- Maryland > Baltimore (0.14)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- New York > New York County
- New York City (0.05)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- Canada
- Quebec > Montreal (0.05)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Alberta > Census Division No. 6
- Calgary Metropolitan Region > Calgary (0.04)
- United States
- Europe
- Austria > Vienna (0.14)
- Greece (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany
- Saxony > Dresden (0.04)
- Bavaria > Upper Bavaria
- Munich (0.04)
- Baden-Württemberg > Stuttgart Region
- Stuttgart (0.05)
- Portugal > Lisbon
- Lisbon (0.04)
- Czechia > South Moravian Region
- Brno (0.04)
- Slovenia > Drava
- Municipality of Benedikt > Benedikt (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- France > Auvergne-Rhône-Alpes
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- United Kingdom
- Scotland > City of Glasgow
- Glasgow (0.04)
- England > East Sussex
- Brighton (0.04)
- Scotland > City of Glasgow
- Asia
- South Korea > Incheon
- Incheon (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Japan
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Honshū > Kantō
- Kanagawa Prefecture > Yokohama (0.05)
- Kyūshū & Okinawa > Kyūshū
- China
- South Korea > Incheon
- Oceania
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.93)
- Research Report
- Industry:
- Health & Medicine (0.47)
- Technology: