Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Alabi, Jesujoba O., Hedderich, Michael A., Adelani, David Ifeoluwa, Klakow, Dietrich
–arXiv.org Artificial Intelligence
With over 2,000 languages and potentially millions of speakers, Africa represents one of the richest linguistic regions in the world. Yet, this diversity is scarcely reflected in state-of-the-art natural language processing (NLP) systems and large language models (LLMs), which predominantly support a narrow set of high-resource languages. This exclusion not only limits the reach and utility of modern NLP technologies but also risks widening the digital divide across linguistic communities. Nevertheless, NLP research on African languages is active and growing. In recent years, there has been a surge of interest in this area, driven by several factors-including the creation of multilingual language resources, the rise of community-led initiatives, and increased support through funding programs. In this survey, we analyze 884 research papers on NLP for African languages published over the past five years, offering a comprehensive overview of recent progress across core tasks. We identify key trends shaping the field and conclude by outlining promising directions to foster more inclusive and sustainable NLP research for African languages.
arXiv.org Artificial Intelligence
Oct-3-2025
- Country:
- Asia (1.00)
- Africa (1.00)
- Europe > Spain (0.67)
- North America > United States
- Minnesota (0.27)
- Genre:
- Overview (1.00)
- Industry:
- Health & Medicine (1.00)
- Education (0.68)
- Technology:
- Information Technology > Artificial Intelligence
- Speech > Speech Recognition (0.93)
- Natural Language
- Text Processing (1.00)
- Machine Translation (1.00)
- Large Language Model (1.00)
- Chatbot (0.93)
- Grammars & Parsing (0.93)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology > Artificial Intelligence