Whispering in Norwegian: Navigating Orthographic and Dialectic Challenges
Kummervold, Per E, de la Rosa, Javier, Wetjen, Freddy, Braaten, Rolv-Arild, Solberg, Per Erik
–arXiv.org Artificial Intelligence
This article introduces NB-Whisper, an adaptation of OpenAI's Whisper, specifically fine-tuned for Norwegian language Automatic Speech Recognition (ASR). We highlight its key contributions and summarise the results achieved in converting spoken Norwegian into written forms and translating other languages into Norwegian. We show that we are able to improve the Norwegian Bokm{\aa}l transcription by OpenAI Whisper Large-v3 from a WER of 10.4 to 6.6 on the Fleurs Dataset and from 6.8 to 2.2 on the NST dataset.
large language model, machine learning, navigating orthographic and dialectic challenge, (6 more...)
arXiv.org Artificial Intelligence
Feb-2-2024