Building English ASR model with regional language support

Agrawal, Purvi, Joshi, Vikas, Patidar, Bharati, Gupta, Ankur, Mehta, Rupesh Kumar

Mar-10-2025–arXiv.org Artificial Intelligence

However, using such a system much higher computation In this paper, we present a novel approach to developing an English cost than monolingual models, as both monolingual models Automatic Speech Recognition (ASR) system that can effectively must produce recognition outputs, increasing the cost of recognition handle Hindi queries, without compromising its performance for the end user. Additionally, the performance of these systems is on English. We propose a novel acoustic model (AM), referred to as sensitive to the accuracy of the LID, as the monolingual models perform SplitHead with Attention (SHA) model, features shared hidden layers well in their respective languages but suffer greatly in other across languages and language-specific projection layers combined languages. The aim of this work is not to build a truly bilingual via a self-attention mechanism. This mechanism estimates the model that can recognize both languages equally well. Instead, the weight for each language based on input data and weighs the corresponding goal is to improve the performance of the Indian English (en-IN) language-specific projection layers accordingly. Additionally, ASR model on Hindi (hi-IN) queries, bringing it reasonably close we propose a language modeling approach that interpolates to the performance of the Hindi model on Hindi queries. This is to n-gram models from both English and transliterated Hindi text corpora.

asr model, projection layer, query, (15 more...)

arXiv.org Artificial Intelligence

Mar-10-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- Asia > India (0.04)
- North America > Canada
  - British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Belgium
  - Brussels-Capital Region > Brussels (0.04)
- Africa > Eswatini
  - Manzini > Manzini (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found