SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
Khan, Abdul Rafae, Kanade, Hrishikesh, Budhrani, Girish Amar, Jhanglani, Preet, Xu, Jia
–arXiv.org Artificial Intelligence
This paper describes the Stevens Institute of Technology's submission for the WMT 2022 Shared Task: Code-mixed Machine Translation (MixMT). The task consisted of two subtasks, subtask $1$ Hindi/English to Hinglish and subtask $2$ Hinglish to English translation. Our findings lie in the improvements made through the use of large pre-trained multilingual NMT models and in-domain datasets, as well as back-translation and ensemble techniques. The translation output is automatically evaluated against the reference translations using ROUGE-L and WER. Our system achieves the $1^{st}$ position on subtask $2$ according to ROUGE-L, WER, and human evaluation, $1^{st}$ position on subtask $1$ according to WER and human evaluation, and $3^{rd}$ position on subtask $1$ with respect to ROUGE-L metric.
arXiv.org Artificial Intelligence
Nov-16-2022
- Country:
- North America
- Dominican Republic (0.04)
- United States > Texas
- Dallas County > Dallas (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > England
- Asia > Indonesia
- Bali (0.04)
- North America
- Genre:
- Research Report (0.70)
- Technology: