Can Domains Be Transferred Across Languages in Multi-Domain Multilingual Neural Machine Translation?
Vu, Thuy-Trang, Khadivi, Shahram, He, Xuanli, Phung, Dinh, Haffari, Gholamreza
–arXiv.org Artificial Intelligence
Previous works mostly focus on either multilingual or multi-domain aspects of neural machine translation (NMT). This paper investigates whether the domain information can be transferred across languages on the composition of multi-domain and multilingual NMT, particularly for the incomplete data condition where in-domain bitext is missing for some language pairs. Our results in the curated leave-one-domain-out experiments show that multi-domain multilingual (MDML) NMT can boost zero-shot translation performance up to +10 gains on BLEU, as well as aid the generalisation of multi-domain NMT to the missing domain. We also explore strategies for effective integration of multilingual and multi-domain NMT, including language and domain tag combination and auxiliary task training. We find that learning domain-aware representations and adding target-language tags to the encoder leads to effective MDML-NMT.
arXiv.org Artificial Intelligence
Oct-20-2022
- Country:
- Oceania > Australia (0.04)
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- Minnesota > Hennepin County
- Europe
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Varna Province
- Varna (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Spain > Valencian Community
- Asia
- Genre:
- Research Report > New Finding (0.87)
- Industry:
- Government (0.46)
- Technology: