AFRICAPTION: Establishing a New Paradigm for Image Captioning in African Languages
Oduwole, Mardiyyah, Mireku, Prince, Adebanjo, Fatimo, Olajide, Oluwatosin, Aliyu, Mahi Aminu, Novikova, Jekaterina
–arXiv.org Artificial Intelligence
Multimodal AI research has overwhelmingly focused on high-resource languages, hindering the democratization of advancements in the field. To address this, we present AfriCaption, a comprehensive framework for multilingual image captioning in 20 African languages and our contributions are threefold: (i) a curated dataset built on Flickr8k, featuring semantically aligned captions generated via a context-aware selection and translation process; (ii) a dynamic, context-preserving pipeline that ensures ongoing quality through model ensembling and adaptive substitution; and (iii) the AfriCaption model, a 0.5B parameter vision-to-text architecture that integrates SigLIP and NLLB200 for caption generation across under-represented languages. This unified framework ensures ongoing data quality and establishes the first scalable image-captioning resource for under-represented African languages, laying the groundwork for truly inclusive multimodal AI.
arXiv.org Artificial Intelligence
Oct-21-2025
- Country:
- Africa
- Togo (0.04)
- Mali (0.04)
- Burkina Faso (0.04)
- Ethiopia (0.04)
- Niger (0.05)
- Middle East > Algeria (0.04)
- Kenya (0.04)
- South Africa (0.04)
- Nigeria (0.05)
- Ghana (0.04)
- Namibia (0.04)
- Rwanda (0.04)
- Uganda (0.04)
- South Sudan (0.04)
- Angola (0.05)
- Senegal (0.04)
- Cameroon (0.04)
- Democratic Republic of the Congo (0.05)
- Benin (0.04)
- Chad (0.04)
- Zambia (0.04)
- Côte d'Ivoire (0.04)
- Europe > Portugal
- North America > United States
- New York > New York County
- New York City (0.04)
- Texas > Dallas County
- Dallas (0.04)
- New York > New York County
- Africa
- Genre:
- Research Report (0.82)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language > Machine Translation (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence