CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Villa-Cueva, Emilio, Bolatzhanova, Sholpan, Turmakhan, Diana, Elzeky, Kareem, Ademtew, Henok Biadglign, Aji, Alham Fikri, Araujo, Vladimir, Azime, Israel Abebe, Baek, Jinheon, Belcavello, Frederico, Cristobal, Fermin, Cruz, Jan Christian Blaise, Dabre, Mary, Dabre, Raj, Ehsan, Toqeer, Etori, Naome A, Farooqui, Fauzan, Geng, Jiahui, Ivetta, Guido, Jayakumar, Thanmay, Jeong, Soyeong, Lim, Zheng Wei, Mandal, Aishik, Martinelli, Sofia, Mihaylov, Mihail Minkov, Orel, Daniil, Pramanick, Aniket, Purkayastha, Sukannya, Salazar, Israfel, Song, Haiyue, Torrent, Tiago Timponi, Yadeta, Debela Desalegn, Hamed, Injy, Tonja, Atnafu Lambebo, Solorio, Thamar
–arXiv.org Artificial Intelligence
Translating cultural content poses challenges for machine translation systems due to the differences in conceptualizations between cultures, where language alone may fail to convey sufficient context to capture region-specific meanings. In this work, we investigate whether images can act as cultural context in multimodal translation. We introduce CaMMT, a human-curated benchmark of over 5,800 triples of images along with parallel captions in English and regional languages. Using this dataset, we evaluate five Vision Language Models (VLMs) in text-only and text+image settings. Through automatic and human evaluations, we find that visual context generally improves translation quality, especially in handling Culturally-Specific Items (CSIs), disambiguation, and correct gender marking. By releasing CaMMT, our objective is to support broader efforts to build and evaluate multimodal translation systems that are better aligned with cultural nuance and regional variations.
arXiv.org Artificial Intelligence
Sep-23-2025
- Country:
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Kenya (0.04)
- Middle East > Egypt (0.04)
- Nigeria (0.04)
- Ethiopia > Addis Ababa
- Asia
- Pakistan (0.04)
- Middle East > Israel (0.04)
- Malaysia (0.04)
- Japan (0.04)
- Indonesia (0.04)
- Philippines (0.04)
- Russia (0.04)
- China (0.04)
- South Korea (0.04)
- India (0.04)
- Europe
- Belgium (0.04)
- Bulgaria (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany
- Hesse > Darmstadt Region
- Darmstadt (0.04)
- Saarland (0.04)
- Hesse > Darmstadt Region
- Russia (0.04)
- North America
- Mexico > Jalisco (0.04)
- United States > Minnesota (0.04)
- South America
- Africa
- Genre:
- Research Report > New Finding (1.00)
- Technology: