Multi-FAct: Assessing Multilingual LLMs' Multi-Regional Knowledge using FActScore
Shafayat, Sheikh, Kim, Eunsu, Oh, Juhyun, Oh, Alice
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) are prone to factuality hallucination, generating text that contradicts established knowledge. While extensive research has addressed this in English, little is known about multilingual LLMs. This paper systematically evaluates multilingual LLMs' factual accuracy across languages and geographic regions. We introduce a novel pipeline for multilingual factuality evaluation, adapting FActScore(Min et al., 2023) for diverse languages. Our analysis across nine languages reveals that English consistently outperforms others in factual accuracy and quantity of generated facts. Furthermore, multilingual models demonstrate a bias towards factual information from Western continents. These findings highlight the need for improved multilingual factuality assessment and underscore geographical biases in LLMs' fact generation.
arXiv.org Artificial Intelligence
Mar-1-2024
- Country:
- South America
- Oceania
- Australia (0.14)
- New Zealand (0.05)
- North America
- Canada (0.14)
- Central America (0.05)
- Dominican Republic (0.04)
- Honduras (0.04)
- Nicaragua (0.04)
- Guatemala (0.04)
- Mexico (0.04)
- Haiti (0.04)
- Cuba (0.04)
- United States > Louisiana
- Orleans Parish > New Orleans (0.04)
- Europe
- Romania (0.14)
- United Kingdom (0.14)
- Eastern Europe (0.05)
- Western Europe (0.05)
- Northern Europe (0.04)
- Austria (0.04)
- Bulgaria (0.04)
- Poland (0.04)
- Germany (0.04)
- Spain (0.04)
- Netherlands (0.04)
- Switzerland (0.04)
- Italy (0.04)
- Russia (0.04)
- Greece (0.04)
- Belgium (0.04)
- Ukraine (0.04)
- Sweden (0.04)
- Czechia (0.04)
- Portugal (0.04)
- Hungary (0.04)
- Belarus (0.04)
- France > Auvergne-Rhône-Alpes
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- India (0.14)
- China (0.14)
- Russia (0.14)
- Pakistan (0.14)
- Singapore (0.04)
- Uzbekistan (0.04)
- Malaysia (0.04)
- Indonesia > Bali (0.04)
- Afghanistan (0.04)
- Thailand (0.04)
- Myanmar (0.04)
- Philippines (0.04)
- Japan (0.04)
- Vietnam (0.04)
- Bangladesh (0.04)
- East Asia (0.04)
- Central Asia (0.04)
- Nepal (0.04)
- South Korea
- Middle East
- Republic of Türkiye (0.14)
- Iran (0.14)
- Saudi Arabia (0.04)
- Yemen (0.04)
- Iraq (0.04)
- Africa
- Democratic Republic of the Congo (0.14)
- East Africa (0.05)
- West Africa (0.05)
- North Africa (0.05)
- South Africa (0.04)
- Côte d'Ivoire (0.04)
- Cameroon (0.04)
- Angola (0.04)
- Madagascar (0.04)
- Tanzania (0.04)
- Uganda (0.04)
- Mozambique (0.04)
- Ghana (0.04)
- Nigeria (0.04)
- Kenya (0.04)
- Sudan (0.04)
- Niger (0.04)
- Burkina Faso (0.04)
- Southern Africa (0.04)
- Mali (0.04)
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Middle East
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Government > Regional Government > Asia Government (1.00)
- Technology: