10 Best African Language Datasets for Data Science Projects
Africa has over 2000 languages, but these languages are not well-represented in the existing Natural Language Processing ecosystem. One challenge is the lack of useful African language datasets that we can use to solve different social and economic problems. In this article, I have compiled a list of African language datasets from across the web. You can use these datasets in various NLP tasks such as text classification, named entity recognition, machine translation, sentiment analysis, speech recognition, and topic modeling. I've made this collection of datasets public to give you an opportunity to use your skills and help solve different challenges.
Jun-27-2021, 21:40:05 GMT
- Country:
- Africa
- Senegal (0.06)
- South Africa (0.06)
- Malawi (0.05)
- East Africa (0.05)
- Rwanda (0.05)
- Niger (0.05)
- Zambia (0.05)
- Benin (0.05)
- Democratic Republic of the Congo (0.05)
- Tanzania (0.05)
- Mauritania (0.05)
- Ghana (0.05)
- Nigeria (0.05)
- The Gambia (0.05)
- West Africa (0.05)
- Southern Africa (0.05)
- Togo (0.05)
- Uganda > Central Region
- Kampala (0.05)
- Africa
- Technology: