The Multilingual Divide and Its Impact on Global AI Safety

Peppin, Aidan, Kreutzer, Julia, Sebag, Alice Schoenauer, Marchisio, Kelly, Ermis, Beyza, Dang, John, Cahyawijaya, Samuel, Singh, Shivalika, Goldfarb-Tarrant, Seraphina, Aryabumi, Viraat, Aakanksha, null, Ko, Wei-Yin, Üstün, Ahmet, Gallé, Matthias, Fadaee, Marzieh, Hooker, Sara

arXiv.org Artificial Intelligence 

Despite advances in large language model capabilities in recent years, a large gap remains in their capabilities and safety performance for many languages beyond a relatively small handful of globally dominant languages. This paper provides researchers, policymakers and governance experts with an overview of key challenges to bridging the "language gap" in AI and minimizing safety risks across languages. We provide an analysis of why the language gap in AI exists and grows, and how it creates disparities in global AI safety. We identify barriers to address these challenges, and recommend how those working in policy and governance can help address safety concerns associated with the language gap by supporting multilingual dataset creation, transparency, and research.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found