MURAL: Multimodal, Multitask Retrieval Across Languages
Jain, Aashi, Guo, Mandy, Srinivasan, Krishna, Chen, Ting, Kudugunta, Sneha, Jia, Chao, Yang, Yinfei, Baldridge, Jason
–arXiv.org Artificial Intelligence
Both image-caption pairs and translation pairs provide the means to learn deep representations of and connections between languages. We use both types of pairs in MURAL (MUltimodal, MUltitask Representations Across Languages), a dual encoder that solves two tasks: 1) image-text matching and 2) translation pair matching. By incorporating billions of translation pairs, MURAL extends ALIGN (Jia et al. PMLR'21)--a state-of-the-art dual encoder learned from 1.8 billion noisy image-text pairs. When using the same encoders, MURAL's performance matches or exceeds ALIGN's cross-modal retrieval performance on well-resourced languages across several datasets. More importantly, it considerably improves performance on under-resourced languages, showing that text-text learning can overcome a paucity of image-caption examples for these languages. On the Wikipedia Image-Text dataset, for example, MURAL-base improves zero-shot mean recall by 8.1% on average for eight under-resourced languages and by 6.8% on average when fine-tuning. We additionally show that MURAL's text representations cluster not only with respect to genealogical connections but also based on areal linguistics, such as the Balkan Sprachbund.
arXiv.org Artificial Intelligence
Sep-10-2021
- Country:
- Oceania > Australia
- North America
- Europe
- Slovenia (0.04)
- Germany > Berlin (0.04)
- Belgium (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- China > Hong Kong (0.04)
- Macao (0.04)
- India (0.04)
- Thailand > Phuket
- Phuket (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Japan
- Kyūshū & Okinawa > Kyūshū
- Miyazaki Prefecture > Miyazaki (0.04)
- Honshū
- Kansai > Kyoto Prefecture
- Kyoto (0.04)
- Chūbu > Aichi Prefecture
- Nagoya (0.04)
- Kansai > Kyoto Prefecture
- Kyūshū & Okinawa > Kyūshū
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Technology: