AITopics | Western Region

Collaborating Authors

Western Region

Data Augmentation With Back translation for Low Resource languages: A case of English and Luganda

Kimera, Richard, Heo, Dongnyeong, Rim, Daniela N., Choi, Heeyoul

arXiv.org Artificial IntelligenceMay-6-2025

In this paper,we explore the application of Back translation (BT) as a semi-supervised technique to enhance Neural Machine Translation(NMT) models for the English-Luganda language pair, specifically addressing the challenges faced by low-resource languages. The purpose of our study is to demonstrate how BT can mitigate the scarcity of bilingual data by generating synthetic data from monolingual corpora. Our methodology involves developing custom NMT models using both publicly available and web-crawled data, and applying Iterative and Incremental Back translation techniques. We strategically select datasets for incremental back translation across multiple small datasets, which is a novel element of our approach. The results of our study show significant improvements, with translation performance for the English-Luganda pair exceeding previous benchmarks by more than 10 BLEU score units across all translation directions. Additionally, our evaluation incorporates comprehensive assessment metrics such as SacreBLEU, ChrF2, and TER, providing a nuanced understanding of translation quality. The conclusion drawn from our research confirms the efficacy of BT when strategically curated datasets are utilized, establishing new performance benchmarks and demonstrating the potential of BT in enhancing NMT models for low-resource languages.

artificial intelligence, machine translation, natural language, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3711542.3711594

2505.02463

Country:

Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.06)
Asia > South Korea (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

PaliGemma-CXR: A Multi-task Multimodal Model for TB Chest X-ray Interpretation

Musinguzi, Denis, Katumba, Andrew, Murindanyi, Sudi

arXiv.org Artificial IntelligenceFeb-28-2025

Tuberculosis (TB) is a infectious global health challenge. Chest X-rays are a standard method for TB screening, yet many countries face a critical shortage of radiologists capable of interpreting these images. Machine learning offers an alternative, as it can automate tasks such as disease diagnosis, and report generation. However, traditional approaches rely on task-specific models, which cannot utilize the interdependence between tasks. Building a multi-task model capable of performing multiple tasks poses additional challenges such as scarcity of multimodal data, dataset imbalance, and negative transfer. To address these challenges, we propose PaliGemma-CXR, a multi-task multimodal model capable of performing TB diagnosis, object detection, segmentation, report generation, and VQA. Starting with a dataset of chest X-ray images annotated with TB diagnosis labels and segmentation masks, we curated a multimodal dataset to support additional tasks. By finetuning PaliGemma on this dataset and sampling data using ratios of the inverse of the size of task datasets, we achieved the following results across all tasks: 90.32% accuracy on TB diagnosis and 98.95% on close-ended VQA, 41.3 BLEU score on report generation, and a mAP of 19.4 and 16.0 on object detection and segmentation, respectively. These results demonstrate that PaliGemma-CXR effectively leverages the interdependence between multiple image interpretation tasks to enhance performance.

dataset, paligemma-cxr, segmentation mask, (12 more...)

arXiv.org Artificial Intelligence

2503.00171

Country:

Africa > Uganda > Western Region > Mbarara District (0.04)
Africa > South Africa (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Learning to Fuse Temporal Proximity Networks: A Case Study in Chimpanzee Social Interactions

He, Yixuan, Sandel, Aaron, Wipf, David, Cucuringu, Mihai, Mitani, John, Reinert, Gesine

arXiv.org Machine LearningJan-31-2025

How can we identify groups of primate individuals which could be conjectured to drive social structure? To address this question, one of us has collected a time series of data for social interactions between chimpanzees. Here we use a network representation, leading to the task of combining these data into a time series of a single weighted network per time stamp, where different proximities should be given different weights reflecting their relative importance. We optimize these proximity-type weights in a principled way, using an innovative loss function which rewards structural consistency across time. The approach is empirically validated by carefully designed synthetic data. Using statistical tests, we provide a way of identifying groups of individuals that stay related for a significant length of time. Applying the approach to the chimpanzee data set, we detect cliques in the animal social network time series, which can be validated by real-world intuition from prior research and qualitative observations by chimpanzee experts.

data mining, machine learning, temporal reasoning, (16 more...)

arXiv.org Machine Learning

2502.00302

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
North America > United States > Texas > Travis County > Austin (0.14)
(7 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Temporal Reasoning (0.41)

Add feedback

Optimizing Vital Sign Monitoring in Resource-Constrained Maternal Care: An RL-Based Restless Bandit Approach

Boehmer, Niclas, Zhao, Yunfan, Xiong, Guojun, Rodriguez-Diaz, Paula, Cibrian, Paola Del Cueto, Ngonzi, Joseph, Boatin, Adeline, Tambe, Milind

arXiv.org Artificial IntelligenceOct-10-2024

Maternal mortality remains a significant global public health challenge. One promising approach to reducing maternal deaths occurring during facility-based childbirth is through early warning systems, which require the consistent monitoring of mothers' vital signs after giving birth. Wireless vital sign monitoring devices offer a labor-efficient solution for continuous monitoring, but their scarcity raises the critical question of how to allocate them most effectively. We devise an allocation algorithm for this problem by modeling it as a variant of the popular Restless Multi-Armed Bandit (RMAB) paradigm. In doing so, we identify and address novel, previously unstudied constraints unique to this domain, which render previous approaches for RMABs unsuitable and significantly increase the complexity of the learning and planning problem. To overcome these challenges, we adopt the popular Proximal Policy Optimization (PPO) algorithm from reinforcement learning to learn an allocation policy by training a policy and value function network. We demonstrate in simulations that our approach outperforms the best heuristic baseline by up to a factor of $4$.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2410.08377

Country:

Africa > Uganda > Western Region > Mbarara District (0.06)
North America > United States > Massachusetts (0.04)
Asia > Middle East > Israel (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)
Health & Medicine > Public Health > Maternal Health (1.00)
Health & Medicine > Diagnostic Medicine > Vital Signs (0.98)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.48)

Add feedback

Democratizing AI in Africa: FL for Low-Resource Edge Devices

Fabila, Jorge, Campello, Víctor M., Martín-Isla, Carlos, Obungoloch, Johnes, Leo, Kinyera, Ronald, Amodoi, Lekadir, Karim

arXiv.org Artificial IntelligenceAug-30-2024

Africa faces significant challenges in healthcare delivery due to limited infrastructure and access to advanced medical technologies. This study explores the use of federated learning to overcome these barriers, focusing on perinatal health. We trained a fetal plane classifier using perinatal data from five African countries: Algeria, Ghana, Egypt, Malawi, and Uganda, along with data from Spanish hospitals. To incorporate the lack of computational resources in the analysis, we considered a heterogeneous set of devices, including a Raspberry Pi and several laptops, for model training. We demonstrate comparative performance between a centralized and a federated model, despite the compute limitations, and a significant improvement in model generalizability when compared to models trained only locally. These results show the potential for a future implementation at a large scale of a federated learning platform to bridge the accessibility gap and improve model generalizability with very little requirements.

dataset, democratizing ai, node, (16 more...)

arXiv.org Artificial Intelligence

2408.17216

Country:

Africa > Ghana (0.26)
Africa > Middle East > Egypt (0.26)
Africa > Middle East > Algeria (0.26)
(6 more...)

Genre: Research Report > New Finding (0.49)

Industry:

Information Technology (1.00)
Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Providers & Services (0.89)
Health & Medicine > Diagnostic Medicine > Imaging (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

New Curriculum, New Chance -- Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation

Kloker, Simon, Bukoli, Herbertson, Kateete, Twaha

arXiv.org Artificial IntelligenceAug-14-2024

Introduction: Poor educational quality in Secondary Schools is still regarded as one of the major struggles in 21st century Uganda - especially in rural areas. Research identifies several problems, including low quality or absent teacher lesson planning. As the government pushes towards the implementation of a new curriculum, exiting lesson plans become obsolete and the problem is worsened. Using a Retrieval Augmented Generation approach, we developed a prototype that generates customized lesson plans based on the government-accredited textbooks. This helps teachers create lesson plans more efficiently and with better quality, ensuring they are fully aligned the new curriculum and the competence-based learning approach. Methods: The prototype was created using Cohere LLM and Sentence Embeddings, and LangChain Framework - and thereafter made available on a public website. Vector stores were trained for three new curriculum textbooks (ICT, Mathematics, History), all at Secondary 1 Level. Twenty-four lessons plans were generated following a pseudo-random generation protocol, based on the suggested periods in the textbooks. The lesson plans were analyzed regarding their technical quality by three independent raters following the Lesson Plan Analysis Protocol (LPAP) by Ndihokubwayo et al. (2022) that is specifically designed for East Africa and competence-based curriculums. Results: Evaluation of 24 lesson plans using the LPAP resulted in an average quality of between 75 and 80%, corresponding to "very good lesson plan". None of the lesson plans scored below 65%, although one lesson plan could be argued to have been missing the topic. In conclusion, the quality of the generated lesson plans is at least comparable, if not better, than those created by humans, as demonstrated in a study in Rwanda, whereby no lesson plan even reached the benchmark of 50%.

lesson plan, lesson planning, ndejje university, (13 more...)

arXiv.org Artificial Intelligence

2408.07542

Country:

Africa > East Africa (0.24)
Africa > Rwanda (0.24)
North America > United States (0.04)
(4 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Curriculum (1.00)
Education > Educational Setting > K-12 Education > Secondary School (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources

Hansen, Lasse Hyldig, Andersen, Nikolaj, Gallifant, Jack, McCoy, Liam G., Stone, James K, Izath, Nura, Aguirre-Jerez, Marcela, Bitterman, Danielle S, Gichoya, Judy, Celi, Leo Anthony

arXiv.org Artificial IntelligenceMay-8-2024

Background Advancements in Large Language Models (LLMs) hold transformative potential in healthcare, however, recent work has raised concern about the tendency of these models to produce outputs that display racial or gender biases. Although training data is a likely source of such biases, exploration of disease and demographic associations in text data at scale has been limited. Methods We conducted a large-scale textual analysis using a dataset comprising diverse web sources, including Arxiv, Wikipedia, and Common Crawl. The study analyzed the context in which various diseases are discussed alongside markers of race and gender. Given that LLMs are pre-trained on similar datasets, this approach allowed us to examine the potential biases that LLMs may learn and internalize. We compared these findings with actual demographic disease prevalence as well as GPT-4 outputs in order to evaluate the extent of bias representation. Results Our findings indicate that demographic terms are disproportionately associated with specific disease concepts in online texts. gender terms are prominently associated with disease concepts, while racial terms are much less frequently associated. We find widespread disparities in the associations of specific racial and gender terms with the 18 diseases analyzed. Most prominently, we see an overall significant overrepresentation of Black race mentions in comparison to population proportions. Conclusions Our results highlight the need for critical examination and transparent reporting of biases in LLM pretraining datasets. Our study suggests the need to develop mitigation strategies to counteract the influence of biased training data in LLMs, particularly in sensitive domains such as healthcare.

dataset, disease mention, keyword, (14 more...)

arXiv.org Artificial Intelligence

2405.05049

Country:

North America > Canada > Alberta (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
Africa > Uganda > Western Region > Mbarara District (0.04)
(8 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

Add feedback

Malaria infection and severe disease risks in Africa

ScienceAug-19-2021, 17:44:05 GMT

Understanding how changes in community parasite prevalence alter the rate and age distribution of severe malaria is essential for optimizing control efforts. Paton et al. assessed the incidence of pediatric severe malaria admissions from 13 hospitals in East Africa from 2006 to 2020 (see the Perspective by Taylor and Slutsker). Each 25% increase in community parasite prevalence shifted hospital admissions toward younger children. Low rates of lifetime infections appeared to confer some immunity to severe malaria in very young children. Children under the age of 5 years thus need to remain a focus of disease prevention for malaria control. Science , abj0089, this issue p. [926][1]; see also abk3443, p. [855][2] The relationship between community prevalence of Plasmodium falciparum and the burden of severe, life-threatening disease remains poorly defined. To examine the three most common severe malaria phenotypes from catchment populations across East Africa, we assembled a dataset of 6506 hospital admissions for malaria in children aged 3 months to 9 years from 2006 to 2020. Admissions were paired with data from community parasite infection surveys. A Bayesian procedure was used to calibrate uncertainties in exposure (parasite prevalence) and outcomes (severe malaria phenotypes). Each 25% increase in prevalence conferred a doubling of severe malaria admission rates. Severe malaria remains a burden predominantly among young children (3 to 59 months) across a wide range of community prevalence typical of East Africa. This study offers a quantitative framework for linking malaria parasite prevalence and severe disease outcomes in children. [1]: /lookup/doi/10.1126/science.abj0089 [2]: /lookup/doi/10.1126/science.abk3443

community parasite prevalence, parasite prevalence, prevalence, (14 more...)

Science

Country:

Africa > East Africa (0.66)
Africa > Uganda > Western Region > Kabale District (0.04)
Africa > Tanzania (0.04)
(3 more...)

Genre: Research Report (0.68)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

Chimps get fussier about who their friends are as they get older - just like humans do

Daily Mail - Science & techOct-22-2020, 18:03:30 GMT

Chimpanzees get more selective over who they associate themselves with as they age, new research reveals. In a study spanning two decades in a Ugandan national park, US experts observed social interactions among 21 wild male chimps, ranging in age from 15 to 58 years. Both chimps and humans prefer to be around the company of old friends and spend less time among new faces, the experts conclude. Ageing male chimps have more mutual and positive friendships than younger chimps, who have more one-sided, antagonistic relationships. Chimps also showed a shift from negative interactions to more positive ones as they reached their twilight years, 'like humans looking for some peace and quiet'.

artificial intelligence, chimp, chimpanzee, (17 more...)

Daily Mail - Science & tech

Country:

North America > United States > Michigan (0.05)
North America > United States > Massachusetts (0.05)
North America > United States > Arizona (0.05)
Africa > Uganda > Western Region > Kibaale District (0.05)

Genre: Research Report > New Finding (0.71)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.38)

Add feedback

Ugandan medics deploy AI to stop women dying after childbirth

#artificialintelligenceJan-31-2020, 18:15:02 GMT

NAIROBI, Jan 31 (Thomson Reuters Foundation) - Ugandan doctors are giving new mothers artificial intelligence-enabled devices to remotely monitor their health in a first-of-its-kind study aiming to curb thousands of preventable maternal deaths across Africa, medics and developers said. Doctors at Mbarara Hospital in western Uganda will give devices to more than 1,000 women who have undergone caesarean section births to wear on their upper arms at all times. Algorithms detect at-risk cases and alert doctors. Joseph Ngonzi from Mbarara University of Science and Technology, which is conducting the study, said it would help "improve monitoring in a resource-constrained environment". The World Health Organization says almost 300,000 women worldwide die annually from preventable causes related to pregnancy and childbirth - that's more than 800 women every day.

childbirth, stop woman, ugandan medic deploy ai, (4 more...)

#artificialintelligence

Country:

Africa > Uganda > Western Region > Mbarara District (0.49)
Africa > Kenya > Nairobi City County > Nairobi (0.27)
North America > United States > New York (0.07)
Africa > Sub-Saharan Africa (0.07)

Industry: Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.40)

Add feedback