Multilingual acoustic word embeddings for zero-resource languages

Jan-23-2024–arXiv.org Artificial Intelligence

This research addresses the challenge of developing speech applications for zero-resource languages that lack labelled data. It specifically uses acoustic word embedding (AWE) -- fixed-dimensional representations of variable-duration speech segments -- employing multilingual transfer, where labelled data from several well-resourced languages are used for pertaining. The study introduces a new neural network that outperforms existing AWE models on zero-resource languages. It explores the impact of the choice of well-resourced languages. AWEs are applied to a keyword-spotting system for hate speech detection in Swahili radio broadcasts, demonstrating robustness in real-world scenarios. Additionally, novel semantic AWE models improve semantic query-by-example search.

awe model, multilingual model, zero-resource language, (17 more...)

arXiv.org Artificial Intelligence

Jan-23-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Africa
  - South Africa (0.14)
  - Sub-Saharan Africa (0.04)
  - Rwanda (0.04)
  - Uganda (0.04)
  - Senegal (0.04)
  - Kenya > Nairobi City County
    - Nairobi (0.04)

Genre:
- Research Report > New Finding (1.00)
- Overview (0.92)

Industry:
- Leisure & Entertainment > Sports (1.00)
- Media > Radio (0.87)

Technology:
- Information Technology
  - Information Management > Search (1.00)
  - Data Science (1.00)
  - Communications (1.00)
  - Artificial Intelligence
    - Speech > Speech Recognition (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language > Text Processing (1.00)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (1.00)