Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Bawden, Rachel, Yvon, François
–arXiv.org Artificial Intelligence
The NLP community recently saw the release of a new large open-access multilingual language model, BLOOM (BigScience et al., 2022) covering 46 languages. We focus on BLOOM's multilingual ability by evaluating its machine translation performance across several datasets (WMT, Flores-101 and DiaBLa) and language pairs (high- and low-resourced). Our results show that 0-shot performance suffers from overgeneration and generating in the wrong language, but this is greatly improved in the few-shot setting, with very good results for a number of language pairs. We study several aspects including prompt design, model sizes, cross-lingual transfer and the use of discursive context.
arXiv.org Artificial Intelligence
May-9-2023
- Country:
- Oceania > Australia
- North America
- United States
- Pennsylvania (0.04)
- Maryland > Baltimore (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > San Diego County
- San Diego (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > British Columbia
- United States
- Europe
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Île-de-France
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > Scotland
- Asia
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- China
- Middle East
- Genre:
- Research Report > New Finding (1.00)
- Technology: