Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Alabi, Jesujoba O., Hedderich, Michael A., Adelani, David Ifeoluwa, Klakow, Dietrich
–arXiv.org Artificial Intelligence
With over 2,000 languages and potentially millions of speakers, Africa represents one of the richest linguistic regions in the world. Yet, this diversity is scarcely reflected in state-of-the-art natural language processing (NLP) systems and large language models (LLMs), which predominantly support a narrow set of high-resource languages. This exclusion not only limits the reach and utility of modern NLP technologies but also risks widening the digital divide across linguistic communities. Nevertheless, NLP research on African languages is active and growing. In recent years, there has been a surge of interest in this area, driven by several factors-including the creation of multilingual language resources, the rise of community-led initiatives, and increased support through funding programs. In this survey, we analyze 884 research papers on NLP for African languages published over the past five years, offering a comprehensive overview of recent progress across core tasks. We identify key trends shaping the field and conclude by outlining promising directions to foster more inclusive and sustainable NLP research for African languages.
arXiv.org Artificial Intelligence
Oct-3-2025
- Country:
- Africa
- Southern Africa (0.04)
- Ethiopia (0.04)
- Niger (0.04)
- Kenya (0.04)
- South Africa (0.04)
- Ghana (0.04)
- Rwanda (0.04)
- Mozambique (0.04)
- Uganda (0.04)
- East Africa (0.04)
- Tanzania (0.04)
- Senegal (0.04)
- Burundi (0.04)
- Nigeria > Jigawa State
- Dutse (0.04)
- Middle East > Somalia (0.04)
- Asia
- China > Hong Kong (0.04)
- Indonesia > Bali (0.04)
- Middle East
- Israel (0.04)
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.05)
- South Korea > Gyeonggi-do
- Suwon (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- Middle East
- Cyprus > Nicosia
- Nicosia (0.04)
- Malta > Eastern Region
- Northern Harbour District > St. Julian's (0.04)
- Cyprus > Nicosia
- Italy
- Piedmont > Turin Province
- Turin (0.04)
- Tuscany > Florence (0.04)
- Piedmont > Turin Province
- Portugal > Lisbon
- Lisbon (0.04)
- Spain
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.05)
- Germany
- Bavaria > Upper Bavaria
- Munich (0.04)
- Saarland (0.04)
- Bavaria > Upper Bavaria
- Austria > Vienna (0.14)
- Ireland > Leinster
- North America
- Canada
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Florida > Miami-Dade County
- Miami (0.05)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- Florida > Miami-Dade County
- South America > Chile
- Africa
- Genre:
- Overview (1.00)
- Industry:
- Education (0.68)
- Health & Medicine (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Natural Language
- Chatbot (0.93)
- Grammars & Parsing (0.93)
- Large Language Model (1.00)
- Machine Translation (1.00)
- Text Processing (1.00)
- Speech > Speech Recognition (0.93)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence