MunTTS: A Text-to-Speech System for Mundari

Gumma, Varun, Hada, Rishav, Yadavalli, Aditya, Gogoi, Pamir, Mondal, Ishani, Seshadri, Vivek, Bali, Kalika

Jan-28-2024–arXiv.org Artificial Intelligence

We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to build a speech synthesis system. We begin our study by gathering a substantial dataset of Mundari text and speech and train end-to-end speech models. We also delve into the methods used for training our models, ensuring they are efficient and effective despite the data constraints. We evaluate our system with native speakers and objective metrics, demonstrating its potential as a tool for preserving and promoting the Mundari language in the digital age.

international conference, mundari, speech, (17 more...)

arXiv.org Artificial Intelligence

Jan-28-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Maryland (0.04)
- Europe
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia
  - Indonesia > Bali (0.05)
  - India
    - Jharkhand (0.04)
    - West Bengal > Kharagpur (0.04)

Genre:
- Research Report (0.50)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Speech > Speech Synthesis (0.95)
  - Machine Learning > Neural Networks (0.94)
  - Vision > Optical Character Recognition (0.63)