CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
Liu, Shudong, Jin, Yiqiao, Li, Cheng, Wong, Derek F., Wen, Qingsong, Sun, Lichao, Chen, Haipeng, Xie, Xing, Wang, Jindong
–arXiv.org Artificial Intelligence
Vision-language models (VLMs) have advanced human-AI interaction but struggle with cultural understanding, often misinterpreting symbols, gestures, and artifacts due to biases in predominantly Western-centric training data. In this paper, we construct CultureVerse, a large-scale multimodal benchmark covering 19, 682 cultural concepts, 188 countries/regions, 15 cultural concepts, and 3 question types, with the aim of characterizing and improving VLMs' multicultural understanding capabilities. Then, we propose CultureVLM, a series of VLMs fine-tuned on our dataset to achieve significant performance improvement in cultural understanding. Our evaluation of 16 models reveals significant disparities, with a stronger performance in Western concepts and weaker results in African and Asian contexts. Fine-tuning on our CultureVerse enhances cultural perception, demonstrating cross-cultural, cross-continent, and cross-dataset generalization without sacrificing performance on models' general VLM benchmarks. We further present insights on cultural generalization and forgetting. We hope that this work could lay the foundation for more equitable and culturally aware multimodal AI systems.
arXiv.org Artificial Intelligence
Jan-2-2025
- Country:
- South America
- Oceania
- Vanuatu (0.04)
- Micronesia (0.04)
- Nauru (0.04)
- Papua New Guinea (0.04)
- Tonga (0.04)
- Solomon Islands (0.04)
- Samoa (0.04)
- Tuvalu (0.04)
- Marshall Islands (0.04)
- Fiji (0.04)
- New Zealand (0.04)
- Kiribati (0.04)
- Australia (0.04)
- North America
- Mexico (0.04)
- Trinidad and Tobago (0.04)
- Panama (0.04)
- Costa Rica (0.04)
- Dominican Republic (0.04)
- Antigua and Barbuda (0.04)
- Honduras (0.04)
- Belize (0.04)
- Nicaragua (0.04)
- Guatemala (0.04)
- Dominica (0.04)
- Barbados (0.04)
- Saint Kitts and Nevis (0.04)
- Jamaica (0.04)
- Canada (0.04)
- El Salvador (0.04)
- Saint Lucia (0.04)
- The Bahamas (0.04)
- Haiti (0.04)
- Cuba (0.04)
- Saint Vincent and the Grenadines (0.04)
- United States
- Pennsylvania (0.04)
- District of Columbia > Washington (0.04)
- Minnesota (0.04)
- Arizona (0.04)
- Europe
- Russia (0.14)
- Bulgaria (0.04)
- Spain (0.04)
- Norway (0.04)
- France (0.04)
- Italy (0.04)
- Romania (0.04)
- Ireland (0.04)
- Montenegro (0.04)
- Bosnia and Herzegovina (0.04)
- Austria (0.04)
- Liechtenstein (0.04)
- United Kingdom > Northern Ireland (0.04)
- Andorra (0.04)
- Poland (0.04)
- Germany (0.04)
- Netherlands (0.04)
- Iceland (0.04)
- Switzerland (0.04)
- Denmark (0.04)
- Finland (0.04)
- Albania (0.04)
- Monaco (0.04)
- Slovakia (0.04)
- Slovenia (0.04)
- Serbia (0.04)
- San Marino (0.04)
- Greece (0.04)
- Latvia (0.04)
- Western Europe (0.04)
- Lithuania (0.04)
- Estonia (0.04)
- Belgium (0.04)
- Ukraine (0.04)
- Croatia (0.04)
- Sweden (0.04)
- Czechia (0.04)
- Moldova (0.04)
- Portugal (0.04)
- Hungary (0.04)
- North Macedonia (0.04)
- Belarus (0.04)
- Middle East
- Asia
- Laos (0.14)
- Russia (0.14)
- India (0.04)
- Indonesia (0.04)
- Singapore (0.04)
- Southeast Asia (0.04)
- Sri Lanka (0.04)
- Philippines (0.04)
- Japan (0.04)
- Malaysia (0.04)
- Mongolia (0.04)
- Pakistan (0.04)
- Afghanistan (0.04)
- Tajikistan (0.04)
- Thailand (0.04)
- Turkmenistan (0.04)
- Kyrgyzstan (0.04)
- Myanmar (0.04)
- Taiwan (0.04)
- South Korea (0.04)
- Armenia (0.04)
- Timor-Leste (0.04)
- Maldives (0.04)
- Vietnam (0.04)
- Azerbaijan (0.04)
- Bhutan (0.04)
- Bangladesh (0.04)
- Uzbekistan (0.04)
- East Asia (0.04)
- Kazakhstan (0.04)
- Macao (0.04)
- Nepal (0.04)
- Cambodia (0.04)
- China
- Beijing > Beijing (0.04)
- Jiangsu Province > Nanjing (0.04)
- Middle East
- Africa
- Ethiopia (0.04)
- Rwanda (0.04)
- Nigeria (0.04)
- Kenya (0.04)
- Côte d'Ivoire (0.04)
- Zambia (0.04)
- Benin (0.04)
- Comoros (0.04)
- Mauritius (0.04)
- Democratic Republic of the Congo (0.04)
- Burundi (0.04)
- Cameroon (0.04)
- Senegal (0.04)
- Angola (0.04)
- Liberia (0.04)
- Malawi (0.04)
- Madagascar (0.04)
- Equatorial Guinea (0.04)
- South Sudan (0.04)
- Tanzania (0.04)
- Gabon (0.04)
- Uganda (0.04)
- Mozambique (0.04)
- Mauritania (0.04)
- Seychelles (0.04)
- Eritrea (0.04)
- Zimbabwe (0.04)
- Namibia (0.04)
- Eswatini (0.04)
- Ghana (0.04)
- Central African Republic (0.04)
- The Gambia (0.04)
- Botswana (0.04)
- South Africa (0.04)
- Sudan (0.04)
- Niger (0.04)
- Burkina Faso (0.04)
- Sierra Leone (0.04)
- Mali (0.04)
- Lesotho (0.04)
- Togo (0.04)
- Middle East
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Technology: