Rule-Based, Neural and LLM Back-Translation: Comparative Insights from a Variant of Ladin
Frontull, Samuel, Moser, Georg
–arXiv.org Artificial Intelligence
This paper explores the impact of different back-translation approaches on machine translation for Ladin, specifically the Val Badia variant. Given the limited amount of parallel data available for this language (only 18k Ladin-Italian sentence pairs), we investigate the performance of a multilingual neural machine translation model fine-tuned for Ladin-Italian. In addition to the available authentic data, we synthesise further translations by using three different models: a fine-tuned neural model, a rule-based system developed specifically for this language pair, and a large language model. Our experiments show that all approaches achieve comparable translation quality in this low-resource scenario, yet round-trip translations highlight differences in model performance.
arXiv.org Artificial Intelligence
Jul-11-2024
- Country:
- Oceania > Australia (0.04)
- North America
- United States
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Germany > Berlin (0.04)
- Latvia > Riga Municipality
- Riga (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Finland
- Austria > Tyrol
- Innsbruck (0.04)
- Portugal > Lisbon
- Lisbon (0.14)
- Italy
- Tuscany > Florence (0.04)
- Trentino-Alto Adige/Südtirol > South Tyrol (0.04)
- Sweden > Östergötland County
- Linköping (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Singapore (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Genre:
- Research Report > New Finding (0.93)
- Technology: