COMET-QE and Active Learning for Low-Resource Machine Translation
Chimoto, Everlyn Asiko, Bassett, Bruce A.
–arXiv.org Artificial Intelligence
Active learning aims to deliver maximum benefit when resources are scarce. We use COMET-QE, a reference-free evaluation metric, to select sentences for low-resource neural machine translation. Using Swahili, Kinyarwanda and Spanish for our experiments, we show that COMET-QE significantly outperforms two variants of Round Trip Translation Likelihood (RTTL) and random sentence selection by up to 5 BLEU points for 20k sentences selected by Active Learning on a 30k baseline. This suggests that COMET-QE is a powerful tool for sentence selection in the very low-resource limit.
arXiv.org Artificial Intelligence
Oct-27-2022
- Country:
- North America > United States
- Pennsylvania (0.04)
- New York (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- Europe
- Germany > Berlin (0.04)
- Sweden > Uppsala County
- Uppsala (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Italy > Tuscany
- Florence (0.04)
- Greece > Attica
- Athens (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > China
- Hong Kong (0.05)
- Africa > South Africa
- Western Cape > Cape Town (0.05)
- North America > United States
- Genre:
- Research Report (0.64)
- Technology: