An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers

Ranasinghe, Tharindu, Orasan, Constantin, Mitkov, Ruslan

May-31-2021–arXiv.org Artificial Intelligence

Most studies on word-level Quality Estimation (QE) of machine translation focus on language-specific models. The obvious disadvantages of these approaches are the need for labelled data for each language pair and the high cost required to maintain several language-specific models. To overcome these problems, we explore different approaches to multilingual, word-level QE. We show that these QE models perform on par with the current language-specific models. In the cases of zero-shot and few-shot QE, we demonstrate that it is possible to accurately predict word-level quality for any given new language pair from models trained on other language pairs. Our findings suggest that the word-level QE models based on powerful pre-trained transformers that we propose in this paper generalise well across languages, making them more useful in real-world scenarios.

computational linguistic, language pair, qe model, (10 more...)

arXiv.org Artificial Intelligence

May-31-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New Mexico > Santa Fe County
    - Santa Fe (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - California > San Diego County
    - San Diego (0.04)
- Europe
  - Slovenia (0.04)
  - Belgium (0.04)
  - United Kingdom > England
    - West Midlands > Wolverhampton (0.04)
    - Surrey (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - China > Beijing
    - Beijing (0.04)

Genre:
- Research Report > New Finding (0.86)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Machine Translation (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found