Improving Rare Word Translation With Dictionaries and Attention Masking
Sible, Kenneth J., Chiang, David
–arXiv.org Artificial Intelligence
In machine translation, rare words continue to be a problem for the dominant encoder-decoder architecture, especially in low-resource and out-of-domain translation settings. Human translators solve this problem with monolingual or bilingual dictionaries. In this paper, we propose appending definitions from a bilingual dictionary to source sentences and using attention masking to link together rare words with their definitions. We find that including definitions for rare words improves performance by up to 1.0 BLEU and 1.6 MacroF1.
arXiv.org Artificial Intelligence
Sep-3-2024
- Country:
- North America
- United States
- Texas > Travis County
- Austin (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Texas > Travis County
- Canada > British Columbia
- United States
- Europe
- Czechia > Prague (0.04)
- Belgium (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- North America
- Genre:
- Research Report (0.82)
- Technology: