Multilingual acoustic word embeddings for zero-resource languages
–arXiv.org Artificial Intelligence
This research addresses the challenge of developing speech applications for zero-resource languages that lack labelled data. It specifically uses acoustic word embedding (AWE) -- fixed-dimensional representations of variable-duration speech segments -- employing multilingual transfer, where labelled data from several well-resourced languages are used for pertaining. The study introduces a new neural network that outperforms existing AWE models on zero-resource languages. It explores the impact of the choice of well-resourced languages. AWEs are applied to a keyword-spotting system for hate speech detection in Swahili radio broadcasts, demonstrating robustness in real-world scenarios. Additionally, novel semantic AWE models improve semantic query-by-example search.
arXiv.org Artificial Intelligence
Jan-23-2024
- Country:
- North America
- United States (0.04)
- Canada > Quebec
- Montreal (0.04)
- Africa
- South Africa (0.14)
- Sub-Saharan Africa (0.04)
- Rwanda (0.04)
- Uganda (0.04)
- Senegal (0.04)
- Kenya > Nairobi City County
- Nairobi (0.04)
- North America
- Genre:
- Research Report > New Finding (1.00)
- Overview (0.92)
- Industry:
- Leisure & Entertainment > Sports (1.00)
- Media > Radio (0.87)
- Technology:
- Information Technology
- Information Management > Search (1.00)
- Data Science (1.00)
- Communications (1.00)
- Artificial Intelligence
- Speech > Speech Recognition (1.00)
- Representation & Reasoning (1.00)
- Natural Language > Text Processing (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology