Diving Deep into Context-Aware Neural Machine Translation
Huo, Jingjing, Herold, Christian, Gao, Yingbo, Dahlmann, Leonard, Khadivi, Shahram, Ney, Hermann
–arXiv.org Artificial Intelligence
Context-aware neural machine translation (NMT) is a promising direction to improve the translation quality by making use of the additional context, e.g., document-level translation, or having meta-information. Although there exist various architectures and analyses, the effectiveness of different context-aware NMT models is not well explored yet. This paper analyzes the performance of document-level NMT models on four diverse domains with a varied amount of parallel document-level bilingual data. We conduct a comprehensive set of experiments to investigate the impact of document-level NMT. We find that there is no single best approach to document-level NMT, but rather that different architectures come out on top on different tasks. Looking at task-specific problems, such as pronoun resolution or headline translation, we find improvements in the context-aware systems, even in cases where the corpus-level metrics like BLEU show no significant improvement. We also show that document-level back-translation significantly helps to compensate for the lack of document-level bi-texts.
arXiv.org Artificial Intelligence
Oct-19-2020
- Country:
- North America > United States
- California > San Diego County > San Diego (0.04)
- Europe
- France (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Aachen (0.04)
- Asia > China
- Hong Kong (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.54)
- Technology: