An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers
Ranasinghe, Tharindu, Orasan, Constantin, Mitkov, Ruslan
–arXiv.org Artificial Intelligence
Most studies on word-level Quality Estimation (QE) of machine translation focus on language-specific models. The obvious disadvantages of these approaches are the need for labelled data for each language pair and the high cost required to maintain several language-specific models. To overcome these problems, we explore different approaches to multilingual, word-level QE. We show that these QE models perform on par with the current language-specific models. In the cases of zero-shot and few-shot QE, we demonstrate that it is possible to accurately predict word-level quality for any given new language pair from models trained on other language pairs. Our findings suggest that the word-level QE models based on powerful pre-trained transformers that we propose in this paper generalise well across languages, making them more useful in real-world scenarios.
arXiv.org Artificial Intelligence
May-31-2021
- Country:
- North America > United States
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- New Mexico > Santa Fe County
- Europe
- Slovenia (0.04)
- Belgium (0.04)
- United Kingdom > England
- West Midlands > Wolverhampton (0.04)
- Surrey (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy > Tuscany
- Florence (0.04)
- Asia
- North America > United States
- Genre:
- Research Report > New Finding (0.86)
- Technology: