Investigating Length Issues in Document-level Machine Translation

Peng, Ziqian, Bawden, Rachel, Yvon, François

Dec-23-2024–arXiv.org Artificial Intelligence

Transformer architectures are increasingly effective at processing and generating very long chunks of texts, opening new perspectives for document-level machine translation (MT). In this work, we challenge the ability of MT systems to handle texts comprising up to several thousands of tokens. We design and implement a new approach designed to precisely measure the effect of length increments on MT outputs. Our experiments with two representative architectures unambiguously show that (a)~translation performance decreases with the length of the input text; (b)~the position of sentences within the document matters and translation quality is higher for sentences occurring earlier in a document. We further show that manipulating the distribution of document lengths and of positional embeddings only marginally mitigates such problems. Our results suggest that even though document-level MT is computationally feasible, it does not yet match the performance of sentence-based MT.

computational linguistic, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Dec-23-2024

arXiv.org PDF

Add feedback

Country:
- Asia (1.00)
- Europe > Portugal
  - Lisbon > Lisbon (0.14)
- North America > United States
  - Minnesota > Hennepin County > Minneapolis (0.14)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.48)
  - Natural Language > Machine Translation (1.00)