Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Apr-11-2024–arXiv.org Artificial Intelligence

Machine Translation (MT) remains one of the last NLP tasks where large language models (LLMs) have not yet replaced dedicated supervised systems. This work exploits the complementary strengths of LLMs and supervised MT by guiding LLMs to automatically post-edit MT with external feedback on its quality, derived from Multidimensional Quality Metric (MQM) annotations. Working with LLaMA-2 models, we consider prompting strategies varying the nature of feedback provided and then fine-tune the LLM to improve its ability to exploit the provided guidance. Through experiments on Chinese-English, English-German, and English-Russian MQM data, we demonstrate that prompting LLMs to post-edit MT improves TER, BLEU and COMET scores, although the benefits of fine-grained feedback are not clear. Fine-tuning helps integrate fine-grained feedback more effectively and further improves translation quality based on both automatic and human evaluation.

annotation, language pair, translation, (15 more...)

arXiv.org Artificial Intelligence

Apr-11-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Maryland (0.04)
  - Washington > King County
    - Seattle (0.04)
  - New York > Monroe County
    - Rochester (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe
  - Germany > Berlin (0.04)
  - Middle East > Malta
    - Eastern Region > Northern Harbour District > St. Julian's (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Finland > Pirkanmaa
    - Tampere (0.04)
  - Bulgaria > Sofia City Province
    - Sofia (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.05)
  - Indonesia > Bali (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Machine Translation (1.00)
    - Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found