On-the-Fly Fusion of Large Language Models and Machine Translation
Hoang, Hieu, Khayrallah, Huda, Junczys-Dowmunt, Marcin
–arXiv.org Artificial Intelligence
We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the same task and input. We perform experiments on 4 language pairs (both directions) with varying data amounts. We find that a slightly weaker-at-translation LLM can improve translations of a NMT model, and ensembling with an LLM can produce better translations than ensembling two stronger MT models. We combine our method with various techniques from LLM prompting, such as in context learning and translation context.
arXiv.org Artificial Intelligence
May-6-2024
- Country:
- Asia
- China > Hong Kong (0.04)
- India > Karnataka
- Bengaluru (0.04)
- Japan > Honshū
- Chūbu > Aichi Prefecture > Nagoya (0.04)
- Middle East
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Republic of Türkiye > Istanbul Province
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Czechia > Prague (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Finland > Pirkanmaa
- Tampere (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Germany > Berlin (0.05)
- North America
- Canada > Ontario
- Toronto (0.05)
- United States
- Maryland > Baltimore (0.04)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Redmond (0.04)
- Canada > Ontario
- Oceania > Australia
- Asia
- Genre:
- Research Report (0.50)
- Technology: