On-the-Fly Fusion of Large Language Models and Machine Translation

Hoang, Hieu, Khayrallah, Huda, Junczys-Dowmunt, Marcin

May-6-2024–arXiv.org Artificial Intelligence

We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the same task and input. We perform experiments on 4 language pairs (both directions) with varying data amounts. We find that a slightly weaker-at-translation LLM can improve translations of a NMT model, and ensembling with an LLM can produce better translations than ensembling two stronger MT models. We combine our method with various techniques from LLM prompting, such as in context learning and translation context.

computational linguistic, proceedings, translation, (12 more...)

arXiv.org Artificial Intelligence

May-6-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - United States
    - Maryland > Baltimore (0.04)
    - Washington > King County
      - Redmond (0.04)
    - New York > New York County
      - New York City (0.04)
  - Canada > Ontario
    - Toronto (0.05)
- Europe
  - Germany > Berlin (0.05)
  - Czechia > Prague (0.04)
  - Spain > Valencian Community
    - Valencia Province > Valencia (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Finland > Pirkanmaa
    - Tampere (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Sweden > Vaestra Goetaland
    - Gothenburg (0.04)
  - Middle East > Republic of Türkiye
    - Istanbul Province > Istanbul (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.04)
  - China > Hong Kong (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - Middle East
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
    - Republic of Türkiye > Istanbul Province
      - Istanbul (0.04)
  - Japan > Honshū
    - Chūbu > Aichi Prefecture > Nagoya (0.04)
  - India > Karnataka
    - Bengaluru (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Machine Translation (1.00)
    - Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found