Pivot Language for Low-Resource Machine Translation
Talwar, Abhimanyu, Laasri, Julien
–arXiv.org Artificial Intelligence
Certain pairs of languages suffer from lack of a parallel corpus which is large in size and diverse in domain. One of the ways this is overcome is via use of a pivot language. In this paper we use Hindi as a pivot language to translate Nepali into English. We describe what makes Hindi a good candidate for the pivot. We discuss ways in which a pivot language can be used, and use two such approaches - the Transfer Method (fully supervised) and Backtransla-tion (semi-supervised) - to translate Nepali into English. Using the former, we are able to achieve a devtest Set SacreBLEU score of 14.2, which improves the baseline fully supervised score reported by (Guzm an et al., 2019) by 6.6 points. While we are slightly below the semi-supervised baseline score of 15.1, we discuss what may have caused this under-performance, and suggest scope for future work.
arXiv.org Artificial Intelligence
May-22-2025
- Country:
- Asia
- India (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Nepal (0.04)
- Europe
- Iceland > Capital Region
- Reykjavik (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Iceland > Capital Region
- North America > United States (0.04)
- South America > Paraguay
- Asia
- Genre:
- Research Report (0.42)
- Technology: