Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

Liu, Qianying, Gong, Zhuo, Yang, Zhengdong, Yang, Yuhang, Li, Sheng, Ding, Chenchen, Minematsu, Nobuaki, Huang, Hao, Cheng, Fei, Chu, Chenhui, Kurohashi, Sadao

Apr-30-2023–arXiv.org Artificial Intelligence

Low-resource speech recognition has been long-suffering from insufficient training data. In this paper, we propose an approach that leverages neighboring languages to improve low-resource scenario performance, founded on the hypothesis that similar linguistic units in neighboring languages exhibit comparable term frequency distributions, which enables us to construct a Huffman tree for performing multilingual hierarchical Softmax decoding. This hierarchical structure enables cross-lingual knowledge sharing among similar tokens, thereby enhancing low-resource training outcomes. Empirical analyses demonstrate that our method is effective in improving the accuracy and efficiency of low-resource speech recognition.

artificial intelligence, machine learning, recognition, (16 more...)

arXiv.org Artificial Intelligence

Apr-30-2023

arXiv.org PDF

Add feedback

Country:
- Europe
  - Portugal (0.04)
  - Finland (0.04)
- Asia
  - China (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture
      - Tokyo (0.14)
    - Kansai > Kyoto Prefecture
      - Kyoto (0.05)
  - Indonesia
    - Sumatra > Aceh (0.04)
    - Sulawesi > South Sulawesi (0.04)
- Africa
  - Liberia (0.04)
  - Cabo Verde (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found