Grounding Multilingual Multimodal LLMs With Cultural Knowledge
Nyandwi, Jean de Dieu, Song, Yueqi, Khanuja, Simran, Neubig, Graham
–arXiv.org Artificial Intelligence
Multimodal Large Language Models excel in high-resource settings, but often misinterpret long-tail cultural entities and underperform in low-resource languages. To address this gap, we propose a data-centric approach that directly grounds MLLMs in cultural knowledge. Leveraging a large scale knowledge graph from Wikidata, we collect images that represent culturally significant entities, and generate synthetic multilingual visual question answering data. The resulting dataset, CulturalGround, comprises 22 million high-quality, culturally-rich VQA pairs spanning 42 countries and 39 languages. We train an open-source MLLM CulturalPangea on CulturalGround, interleaving standard multilingual instruction-tuning data to preserve general abilities. CulturalPangea achieves state-of-the-art performance among open models on various culture-focused multilingual multimodal benchmarks, outperforming prior models by an average of 5.0 without degrading results on mainstream vision-language tasks. Our findings show that our targeted, culturally grounded approach could substantially narrow the cultural gap in MLLMs and offer a practical path towards globally inclusive multimodal systems.
arXiv.org Artificial Intelligence
Aug-13-2025
- Country:
- Africa
- Asia
- Pakistan (0.04)
- Mongolia (0.04)
- Malaysia (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Nagasaki Prefecture > Nagasaki (0.04)
- Bangladesh (0.04)
- Vietnam (0.04)
- Indonesia (0.04)
- Middle East
- Iran (0.04)
- Israel (0.04)
- Republic of Türkiye (0.04)
- Saudi Arabia (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Russia (0.04)
- China (0.04)
- South Korea (0.04)
- Taiwan (0.04)
- Sri Lanka (0.04)
- Southeast Asia (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Singapore (0.04)
- India (0.15)
- Europe
- Portugal (0.04)
- United Kingdom
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Czechia (0.04)
- Ukraine (0.04)
- Romania (0.04)
- Greece (0.04)
- Russia (0.04)
- Italy (0.04)
- France (0.04)
- Norway (0.04)
- Netherlands (0.04)
- Spain (0.14)
- Germany (0.04)
- Poland (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Bulgaria > Plovdiv Province
- Plovdiv (0.04)
- North America
- Mexico (0.04)
- United States > Pennsylvania
- Allegheny County > Pittsburgh (0.04)
- South America > Brazil (0.04)
- Genre:
- Research Report > New Finding (0.86)
- Industry:
- Education (0.68)
- Government (0.93)
- Leisure & Entertainment (1.00)
- Media > Music (0.67)
- Technology: