Automatic Input Rewriting Improves Translation with Large Language Models
–arXiv.org Artificial Intelligence
Can we improve machine translation (MT) with LLMs by rewriting their inputs automatically? Users commonly rely on the intuition that well-written text is easier to translate when using off-the-shelf MT systems. LLMs can rewrite text in many ways but in the context of MT, these capabilities have been primarily exploited to rewrite outputs via post-editing. We present an empirical study of 21 input rewriting methods with 3 open-weight LLMs for translating from English into 6 target languages. We show that text simplification is the most effective MT-agnostic rewrite strategy and that it can be improved further when using quality estimation to assess translatability. Human evaluation further confirms that simplified rewrites and their MT outputs both largely preserve the original meaning of the source and MT. These results suggest LLM-assisted input rewriting as a promising direction for improving translations.
arXiv.org Artificial Intelligence
Feb-23-2025
- Country:
- Oceania > Guam (0.04)
- North America
- United States
- Hawaii (0.04)
- South Dakota (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Maryland > Howard County
- Columbia (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Mexico
- Mexico City > Mexico City (0.04)
- Jalisco > Guadalajara (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Austria > Vienna (0.04)
- Spain (0.04)
- France (0.04)
- Italy > Tuscany
- Florence (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Romania > Nord-Est Development Region
- Suceava County > Suceava (0.04)
- Switzerland > Geneva
- Geneva (0.04)
- Asia
- Singapore (0.05)
- Philippines (0.04)
- British Indian Ocean Territory > Diego Garcia (0.04)
- Vietnam > Hồ Chí Minh City
- Hồ Chí Minh City (0.04)
- Thailand
- Middle East
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Saudi Arabia > Asir Province
- Abha (0.04)
- UAE > Abu Dhabi Emirate
- India > Karnataka
- Bengaluru (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: