Towards interfacing large language models with ASR systems using confidence measures and prompting

Naderi, Maryam, Hermann, Enno, Nanchen, Alexandre, Hovsepyan, Sevada, -Doss, Mathew Magimai.

Jul-31-2024–arXiv.org Artificial Intelligence

As large language models (LLMs) grow in parameter size and capabilities, such as interaction through prompting, they open up new ways of interfacing with automatic speech recognition (ASR) systems beyond rescoring n-best lists. This work investigates post-hoc correction of ASR transcripts with LLMs. To avoid introducing errors into likely accurate transcripts, we propose a range of confidence-based filtering methods. Our results indicate that this can improve the performance of less competitive ASR systems.

correction, llm, transcription, (14 more...)

arXiv.org Artificial Intelligence

Jul-31-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Switzerland (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Health & Medicine (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.33)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found