Out of the Box, into the Clinic? Evaluating State-of-the-Art ASR for Clinical Applications for Older Adults
van Dijk, Bram, Kuiper, Tiberon, Ahmed, Sirin Aoulad si, Levebvre, Armel, Johnson, Jake, Duin, Jan, Mooijaart, Simon, Spruit, Marco
–arXiv.org Artificial Intelligence
Voice-controlled interfaces can support older adults in clinical contexts -- with chatbots being a prime example -- but reliable Automatic Speech Recognition (ASR) for underrepresented groups remains a bottleneck. This study evaluates state-of-the-art ASR models on language use of older Dutch adults, who interacted with the Welzijn.AI chatbot designed for geriatric contexts. We benchmark generic multilingual ASR models, and models fine-tuned for Dutch spoken by older adults, while also considering processing speed. Our results show that generic multilingual models outperform fine-tuned models, which suggests recent ASR models can generalise well out of the box to real-world datasets. Moreover, our results indicate that truncating generic models is helpful in balancing the accuracy-speed trade-off. Nonetheless, we also find inputs which cause a high word error rate and place them in context.
arXiv.org Artificial Intelligence
Oct-2-2025
- Country:
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Netherlands
- North Holland > Amsterdam (0.05)
- South Holland > Leiden (0.05)
- France > Provence-Alpes-Côte d'Azur
- Europe
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: