Understanding Cross-Lingual Alignment -- A Survey
Hämmerl, Katharina, Libovický, Jindřich, Fraser, Alexander
–arXiv.org Artificial Intelligence
Cross-lingual alignment, the meaningful similarity of representations across languages in multilingual language models, has been an active field of research in recent years. We survey the literature of techniques to improve cross-lingual alignment, providing a taxonomy of methods and summarising insights from throughout the field. We present different understandings of cross-lingual alignment and their limitations. We provide a qualitative summary of results from a large number of surveyed papers. Finally, we discuss how these insights may be applied not only to encoder models, where this topic has been heavily studied, but also to encoder-decoder or even decoder-only models, and argue that an effective trade-off between language-neutral and language-specific information is key.
arXiv.org Artificial Intelligence
Jun-11-2024
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Canada > Ontario
- Toronto (0.04)
- Europe
- Switzerland (0.04)
- Czechia (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Italy > Tuscany
- Florence (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- France > Île-de-France
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Singapore (0.04)
- China > Hong Kong (0.04)
- Indonesia > Bali (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Genre:
- Research Report (1.00)
- Overview (0.68)
- Technology: