kenya
- North America > Canada > Quebec > Montreal (0.14)
- Africa > Kenya (0.07)
- North America > United States > California (0.05)
- (12 more...)
- Health & Medicine (0.68)
- Energy > Renewable (0.31)
The Download: Kenya's Great Carbon Valley, and the AI terms that were everywhere in 2025
The Download: Kenya's Great Carbon Valley, and the AI terms that were everywhere in 2025 Welcome to Kenya's Great Carbon Valley: a bold new gamble to fight climate change In June last year, startup Octavia Carbon began running a high-stakes test in the small town of Gilgil in south-central Kenya. It's harnessing some of the excess energy generated by vast clouds of steam under the Earth's surface to power prototypes of a machine that promises to remove carbon dioxide from the air in a manner that the company says is efficient, affordable, and--crucially--scalable. The company's long-term vision is undoubtedly ambitious--it wants to prove that direct air capture (DAC), as the process is known, can be a powerful tool to help the world keep temperatures from rising to ever more dangerous levels. But DAC is also a controversial technology, unproven at scale and wildly expensive to operate. On top of that, Kenya's Maasai people have plenty of reasons to distrust energy companies. This article is also part of the Big Story series: 's most important, ambitious reporting.
- Africa > Kenya > Central Province (0.25)
- Asia > China (0.07)
- South America > Venezuela (0.05)
- (3 more...)
- Health & Medicine (1.00)
- Energy (0.90)
- Media (0.71)
Edge-Based Speech Transcription and Synthesis for Kinyarwanda and Swahili Languages
Mbonimpa, Pacome Simon, Tuyizere, Diane, Biyabani, Azizuddin Ahmed, Tonguz, Ozan K.
Abstract--This paper presents a novel framework for speech transcription and synthesis, leveraging edge-cloud parallelism to enhance processing speed and accessibility for Kinyarwanda and Swahili speakers. It addresses the scarcity of powerful language processing tools for these widely spoken languages in East African countries with limited technological infrastructure. The framework utilizes the Whisper and SpeechT5 pre-trained models to enable speech-to-text (STT) and text-to-speech (TTS) translation. The architecture uses a cascading mechanism that distributes the model inference workload between the edge device and the cloud, thereby reducing latency and resource usage, benefiting both ends. On the edge device, our approach achieves a memory usage compression of 9.5% for the SpeechT5 model and 14% for the Whisper model, with a maximum memory usage of 149 MB. Experimental results indicate that on a 1.7 GHz CPU edge device with a 1 MB/s network bandwidth, the system can process a 270-character text in less than a minute for both speech-to-text and text-to-speech transcription. Using real-world survey data from Kenya, it is shown that the cascaded edge-cloud architecture proposed could easily serve as an excellent platform for STT and TTS transcription with good accuracy and response time. I. INTRODUCTION In today's digital age, the need for accurate and efficient speech transcription and synthesis models has been increasing rapidly. These models play an important role in a variety of applications, such as learning new language(s), accessibility tools for people with difficulties in reading and hearing, as well as automated voice assistants [1]. Kinyarwanda and Swahili are two of the local languages spoken in East Africa. While Swahili is the most widely spoken language in Eastern Africa, the speakers range from 60 million to over 150 million [2].
- Africa > East Africa (0.54)
- Africa > Kenya (0.26)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- (5 more...)
This overlooked human ancestor wielded tools with a gorilla-like grip
Researchers in Kenya excavated the first hand and foot bones belonging to'Paranthropis boisei.' Breakthroughs, discoveries, and DIY tips sent every weekday. When it comes to our evolutionary cousins, Neanderthals get most of the attention. Part of this is understandable, since there was a time in Earth's history when the role of dominant primate was up for grabs . Tweak any number of environmental factors, and the tool-wielding, yarn-weaving may have outlasted its cousins (aka humans) instead of going extinct around 40,000 years ago.
- North America > Canada > Quebec > Montreal (0.14)
- Africa > Kenya (0.07)
- North America > United States > California (0.05)
- (12 more...)
The Download: our thawing permafrost, and a drone-filled future
Plus: America's first AI law is here Scientists can see Earth's permafrost thawing from space Something is rotten in the city of Nunapitchuk. In recent years, sewage has leached into the earth. The ground can feel squishy, sodden. This small town in northern Alaska is experiencing a sometimes overlooked consequence of climate change: thawing permafrost. And Nunapitchuk is far from the only Arctic town to find itself in such a predicament. Now scientists think they may be able to use satellite data to delve deep beneath the ground's surface and get a better understanding of how the permafrost thaws, and which areas might be most severely affected.
- North America > United States > Alaska (0.25)
- Asia > South Korea (0.15)
- Africa > Kenya (0.06)
- (5 more...)
Retrieval-Augmented Clinical Benchmarking for Contextual Model Testing in Kenyan Primary Care: A Methodology Paper
Mutisya, Fred, Gitau, Shikoh, Syovata, Christine, Oigara, Diana, Matende, Ibrahim, Aden, Muna, Ali, Munira, Nyotu, Ryan, Marion, Diana, Nyangena, Job, Ongoma, Nasubo, Mbae, Keith, Wamicha, Elizabeth, Mibuari, Eric, Nsengemana, Jean Philbert, Chidede, Talkmore
Large Language Models (LLMs) hold promise for improving healthcare access in low-resource settings, but their effectiveness in African primary care contexts remains under-explored. We present a rigorous methodology for creating a benchmark dataset and evaluation framework focused on Kenyan Level 2-3 (dispensary and health center) clinical care. Our approach leverages retrieval-augmented generation (RAG) to ground questions and answers in Kenya's national clinical guidelines, ensuring content aligns with local standard-of-care. The guidelines were digitised, chunked, and indexed for efficient semantic retrieval. Gemini Flash 2.0 Lite was then prompted with relevant guideline excerpts to generate realistic clinical questions, multiple - choice answers, and reasoning scenarios with source citations in English and Swahili. We engaged Kenyan physicians in a co - creation process to refine the dataset's relevance and fairness, and instituted a blinded expert validation pipeline to review for clinical accuracy, clarity, and cultural appropriateness. The resulting Alama Health QA dataset comprises thousands of regulator-aligned question-answer pairs spanning common outpatient conditions in English and Swahili. Beyond standard accuracy metrics, we propose innovative evaluation measures targeting clinical reasoning, safety, and adaptability (e.g. Initial results highlight significant performance gaps in state - of-the - art LLMs when confronted with localized scenarios, echoing recent findings that LLM accuracy on African medical questions lags behind performance on U.S. benchmarks. Our work demonstrates a pathway for dynamic, locally-grounded benchmarks that can evolve with guidelines, providing a crucial tool for safe and effective deployment of AI in African healthcare. Advances in large language models have spurred interest in their potential to augment medical services, especially in low-and middle -income countries facing clinician shortages(Bekbolatova et al., 2024). By handling routine queries or providing decision support, LLMs might help bridge gaps in healthcare access across Africa.
- Africa > South Africa > Gauteng > Johannesburg (0.04)
- Africa > Kenya > Kisumu County > Kisumu (0.04)
- Europe > United Kingdom (0.04)
- (3 more...)
Dukawalla: Voice Interfaces for Small Businesses in Africa
Ankrah, Elizabeth, Nyairo, Stephanie, Muchai, Mercy, Awori, Kagonya, Ochieng, Millicent, Kariuki, Mark, O'Neill, Jacki
Small and medium sized businesses often struggle with data driven decision making do to a lack of advanced analytics tools, especially in African countries where they make up a majority of the workforce. Though many tools exist they are not designed to fit into the ways of working of SMB workers who are mobile first, have limited time to learn new workflows, and for whom social and business are tightly coupled. To address this, the Dukawalla prototype was created. This intelligent assistant bridges the gap between raw business data, and actionable insights by leveraging voice interaction and the power of generative AI. Dukawalla provides an intuitive way for business owners to interact with their data, aiding in informed decision making. This paper examines Dukawalla's deployment across SMBs in Nairobi, focusing on their experiences using this voice based assistant to streamline data collection and provide business insights
- Africa > Kenya > Nairobi City County > Nairobi (0.26)
- North America > United States > California > Orange County > Irvine (0.05)
- Africa > Kenya > Nairobi Province (0.05)
- North America > United States > New York > New York County > New York City (0.04)
RideKE: Leveraging Low-Resource, User-Generated Twitter Content for Sentiment and Emotion Detection in Kenyan Code-Switched Dataset
Etori, Naome A., Gini, Maria L.
Social media has become a crucial open-access platform for individuals to express opinions and share experiences. However, leveraging low-resource language data from Twitter is challenging due to scarce, poor-quality content and the major variations in language use, such as slang and code-switching. Identifying tweets in these languages can be difficult as Twitter primarily supports high-resource languages. We analyze Kenyan code-switched data and evaluate four state-of-the-art (SOTA) transformer-based pretrained models for sentiment and emotion classification, using supervised and semi-supervised methods. We detail the methodology behind data collection and annotation, and the challenges encountered during the data curation phase. Our results show that XLM-R outperforms other models; for sentiment analysis, XLM-R supervised model achieves the highest accuracy (69.2\%) and F1 score (66.1\%), XLM-R semi-supervised (67.2\% accuracy, 64.1\% F1 score). In emotion analysis, DistilBERT supervised leads in accuracy (59.8\%) and F1 score (31\%), mBERT semi-supervised (accuracy (59\% and F1 score 26.5\%). AfriBERTa models show the lowest accuracy and F1 scores. All models tend to predict neutral sentiment, with Afri-BERT showing the highest bias and unique sensitivity to empathy emotion. https://github.com/NEtori21/Ride_hailing
- Africa > Kenya > Nairobi City County > Nairobi (0.07)
- Africa > Kenya > Nairobi Province (0.06)
- Africa > Kenya > Mombasa County > Mombasa (0.05)
- (18 more...)
- Transportation > Passenger (1.00)
- Information Technology (1.00)
- Transportation > Ground > Road (0.93)
Building low-resource African language corpora: A case study of Kidawida, Kalenjin and Dholuo
Mbogho, Audrey, Awuor, Quin, Kipkebut, Andrew, Wanzare, Lilian, Oloo, Vivian
Natural Language Processing is a crucial frontier in artificial intelligence, with broad applications in many areas, including public health, agriculture, education, and commerce. However, due to the lack of substantial linguistic resources, many African languages remain underrepresented in this digital transformation. This paper presents a case study on the development of linguistic corpora for three under-resourced Kenyan languages, Kidaw'ida, Kalenjin, and Dholuo, with the aim of advancing natural language processing and linguistic research in African communities. Our project, which lasted one year, employed a selective crowd-sourcing methodology to collect text and speech data from native speakers of these languages. Data collection involved (1) recording conversations and translation of the resulting text into Kiswahili, thereby creating parallel corpora, and (2) reading and recording written texts to generate speech corpora. We made these resources freely accessible via open-research platforms, namely Zenodo for the parallel text corpora and Mozilla Common Voice for the speech datasets, thus facilitating ongoing contributions and access for developers to train models and develop Natural Language Processing applications. The project demonstrates how grassroots efforts in corpus building can support the inclusion of African languages in artificial intelligence innovations. In addition to filling resource gaps, these corpora are vital in promoting linguistic diversity and empowering local communities by enabling Natural Language Processing applications tailored to their needs. As African countries like Kenya increasingly embrace digital transformation, developing indigenous language resources becomes essential for inclusive growth. We encourage continued collaboration from native speakers and developers to expand and utilize these corpora.
- Africa > South Sudan (0.14)
- Africa > Uganda (0.05)
- North America > United States (0.04)
- (17 more...)
- Health & Medicine (0.67)
- Media > News (0.46)