The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation
–arXiv.org Artificial Intelligence
Prior research diverges on language diversity in LLM fine-tuning: Some studies report benefits while others find no advantages. Through controlled fine-tuning experiments across 132 translation directions, we systematically resolve these disparities. We find that expanding language diversity during fine-tuning improves translation quality for both unsupervised and -- surprisingly -- supervised pairs, despite less diverse models being fine-tuned exclusively on these supervised pairs. However, benefits plateau or decrease beyond a certain diversity threshold. We show that increased language diversity creates more language-agnostic representations. These representational adaptations help explain the improved performance in models fine-tuned with greater diversity.
arXiv.org Artificial Intelligence
Sep-22-2025
- Country:
- Europe (0.68)
- North America > United States
- Minnesota (0.28)
- Asia > Middle East
- UAE (0.46)
- Genre:
- Research Report > New Finding (0.94)
- Technology: