AITopics | Harare

Collaborating Authors

Harare

US's new scramble for Africa is biomedical imperialism

Al JazeeraMar-13-2026, 14:16:01 GMT

US's new scramble for Africa is biomedical imperialism Late in February, Zimbabwe pulled out of a proposed $367m United States health funding agreement after objecting to provisions requiring broad American access to sensitive health data. The five-year programme was presented as support for HIV/AIDS, tuberculosis, malaria and epidemic preparedness efforts. However, the terms demanded extensive sharing of national health intelligence, including epidemiological surveillance data and pathogen samples, while offering no binding guarantees that Zimbabwe would receive equitable access to medical technologies developed from them. Harare called the proposal an "unequal exchange", warning that Zimbabwe risked supplying the "raw materials for scientific discovery" while the resulting benefits could remain concentrated in the United States and global pharmaceutical firms. Critics increasingly describe this pattern as biomedical extractivism: a toxic combination of exploitative research practices and colonial thinking that reinforces Western dominance.

artificial intelligence, bioinformatics, united states, (10 more...)

Al Jazeera

Country:

North America > United States (1.00)
South America (0.41)
North America > Central America (0.41)
(11 more...)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Biomedical Informatics (0.37)
Information Technology > Artificial Intelligence (0.35)

Add feedback

Shona spaCy: A Morphological Analyzer for an Under-Resourced Bantu Language

Masoka, Happymore

arXiv.org Artificial IntelligenceNov-24-2025

Despite rapid advances in multilingual natural language processing (NLP), the Bantu language Shona remains under-served in terms of morphological analysis and language-aware tools. This paper presents Shona spaCy, an open-source, rule-based morphological pipeline for Shona built on the spaCy framework. The system combines a curated JSON lexicon with linguistically grounded rules to model noun-class prefixes (Mupanda 1-18), verbal subject concords, tense-aspect markers, ideophones, and clitics, integrating these into token-level annotations for lemma, part-of-speech, and morphological features. The toolkit is available via pip install shona-spacy, with source code at https://github.com/HappymoreMasoka/shona-spacy and a PyPI release at https://pypi.org/project/shona-spacy/0.1.4/. Evaluation on formal and informal Shona corpora yields 90% POS-tagging accuracy and 88% morphological-feature accuracy, while maintaining transparency in its linguistic decisions. By bridging descriptive grammar and computational implementation, Shona spaCy advances NLP accessibility and digital inclusion for Shona speakers and provides a template for morphological analysis tools for other under-resourced Bantu languages.

artificial intelligence, natural language, shona, (18 more...)

arXiv.org Artificial Intelligence

2511.1668

Country:

Africa > Zimbabwe > Harare > Harare (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Africa > South Africa > Western Cape > Cape Town (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.52)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.47)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.47)

Add feedback

Mechanistic Interpretability with SAEs: Probing Religion, Violence, and Geography in Large Language Models

Simbeck, Katharina, Mahran, Mariam

arXiv.org Artificial IntelligenceSep-23-2025

Despite growing research on bias in large language models (LLMs), most work has focused on gender and race, with little attention to religious identity. This paper explores how religion is internally represented in LLMs and how it intersects with concepts of violence and geography. Using mechanistic interpretability and Sparse Autoencoders (SAEs) via the Neuronpedia API, we analyze latent feature activations across five models. We measure overlap between religion- and violence-related prompts and probe semantic patterns in activation contexts. While all five religions show comparable internal cohesion, Islam is more frequently linked to features associated with violent language. In contrast, geographic associations largely reflect real-world religious demographics, revealing how models embed both factual distributions and cultural stereotypes. These findings highlight the value of structural analysis in auditing not just outputs but also internal representations that shape model behavior.

large language model, machine learning, religion, (18 more...)

arXiv.org Artificial Intelligence

2509.17665

Country:

North America > United States > New York > New York County > New York City (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.14)
(225 more...)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts

Tighidet, Zineddine, Mogini, Andrea, Ben-younes, Hedi, Mei, Jiali, Gallinari, Patrick, Piwowarski, Benjamin

arXiv.org Artificial IntelligenceSep-18-2025

The behavior of Large Language Models (LLMs) when facing contextual information that conflicts with their internal parametric knowledge is inconsistent, with no generally accepted explanation for the expected outcome distribution. Recent work has identified in autoregressive transformer models a class of neurons -- called entropy neurons -- that produce a significant effect on the model output entropy while having an overall moderate impact on the ranking of the predicted tokens. In this paper, we investigate the preliminary claim that these neurons are involved in inhibiting context copying behavior in transformers by looking at their role in resolving conflicts between contextual and parametric information. We show that entropy neurons are responsible for suppressing context copying across a range of LLMs, and that ablating them leads to a significant change in the generation process. These results enhance our understanding of the internal dynamics of LLMs when handling conflicting information.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2509.10663

Country:

Europe > Italy (0.05)
Europe > France > Île-de-France > Paris > Paris (0.04)
North America > United States > Oregon > Harney County (0.04)
(10 more...)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Add feedback

Meet Britain's real-life SUPERVILLAIN: Eccentric millionaire lives in a bunker beneath a Cold War radar - and is convinced he's going to find UFOs

Daily Mail - Science & techAug-17-2025, 13:24:45 GMT

Some millionaires might be happy frittering away their hard–earned cash on speed boats, golfing holidays, and perhaps the odd football team or two. But William Sachiti is far from your run–of–the–mill businessman. Much more Blofeld than Bill Gates, Mr Sachiti has decided to use his millions in a far less conventional way. Naturally, that meant buying a Cold War RAF base and firing up the radar station to hunt for UFOs. From his'supervillain lair' in the nuclear bunker beneath former RAF Neatishead, Norfolk, Mr Sachiti is building what may be the world's most sophisticated UFO–hunting machine.

radar station, raf neatishead, sachiti, (15 more...)

Daily Mail - Science & tech

Country:

Europe > United Kingdom (0.84)
Africa > Zimbabwe > Harare > Harare (0.04)

Industry: Government > Military (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.97)

Add feedback

A Deep Learning Automatic Speech Recognition Model for Shona Language

Sirora, Leslie Wellington, Mutandavari, Mainford

arXiv.org Artificial IntelligenceJul-30-2025

This study presented the development of a deep learning-based Automatic Speech Recognition system for Shona, a low-resource language characterized by unique tonal and grammatical complexities. The research aimed to address the challenges posed by limited training data, lack of labelled data, and the intricate tonal nuances present in Shona speech, with the objective of achieving significant improvements in recognition accuracy compared to traditional statistical models. The research first explored the feasibility of using deep learning to develop an accurate ASR system for Shona. Second, it investigated the specific challenges involved in designing and implementing deep learning architectures for Shona speech recognition and proposed strategies to mitigate these challenges. Lastly, it compared the performance of the deep learning-based model with existing statistical models in terms of accuracy. The developed ASR system utilized a hybrid architecture consisting of a Convolutional Neural Network for acoustic modelling and a Long Short-Term Memory network for language modelling. To overcome the scarcity of data, data augmentation techniques and transfer learning were employed. Attention mechanisms were also incorporated to accommodate the tonal nature of Shona speech. The resulting ASR system achieved impressive results, with a Word Error Rate of 29%, Phoneme Error Rate of 12%, and an overall accuracy of 74%. These metrics indicated the potential of deep learning to enhance ASR accuracy for under-resourced languages like Shona. This study contributed to the advancement of ASR technology for under-resourced languages like Shona, ultimately fostering improved accessibility and communication for Shona speakers worldwide.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.15680/IJIRCCE.2024.1206001

2507.21331

Country:

Africa > Zimbabwe > Harare > Harare (0.05)
North America > United States > New York (0.04)
North America > United States > Illinois (0.04)
(5 more...)

Genre: Research Report > New Finding (0.69)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)

Add feedback

MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models

Dihan, Mahir Labib, Hassan, Md Tanvir, Parvez, Md Tanvir, Hasan, Md Hasebul, Alam, Md Almash, Cheema, Muhammad Aamir, Ali, Mohammed Eunus, Parvez, Md Rizwan

arXiv.org Artificial IntelligenceDec-31-2024

Recent advancements in foundation models have enhanced AI systems' capabilities in autonomous tool usage and reasoning. However, their ability in location or map-based reasoning - which improves daily life by optimizing navigation, facilitating resource discovery, and streamlining logistics - has not been systematically studied. To bridge this gap, we introduce MapEval, a benchmark designed to assess diverse and complex map-based user queries with geo-spatial reasoning. MapEval features three task types (textual, API-based, and visual) that require collecting world information via map tools, processing heterogeneous geo-spatial contexts (e.g., named entities, travel distances, user reviews or ratings, images), and compositional reasoning, which all state-of-the-art foundation models find challenging. Comprising 700 unique multiple-choice questions about locations across 180 cities and 54 countries, MapEval evaluates foundation models' ability to handle spatial relationships, map infographics, travel planning, and navigation challenges. Using MapEval, we conducted a comprehensive evaluation of 28 prominent foundation models. While no single model excelled across all tasks, Claude-3.5-Sonnet, GPT-4o, and Gemini-1.5-Pro achieved competitive performance overall. However, substantial performance gaps emerged, particularly in MapEval, where agents with Claude-3.5-Sonnet outperformed GPT-4o and Gemini-1.5-Pro by 16% and 21%, respectively, and the gaps became even more amplified when compared to open-source LLMs. Our detailed analyses provide insights into the strengths and weaknesses of current models, though all models still fall short of human performance by more than 20% on average, struggling with complex map images and rigorous geo-spatial reasoning. This gap highlights MapEval's critical role in advancing general-purpose foundation models with stronger geo-spatial understanding.

gemini-1, information, llama-3, (15 more...)

arXiv.org Artificial Intelligence

2501.00316

Country:

North America > United States > New York > New York County > New York City (0.14)
Asia > Malaysia > Kuala Lumpur > Kuala Lumpur (0.05)
Europe > Finland > Uusimaa > Helsinki (0.05)
(53 more...)

Genre: Research Report > New Finding (0.45)

Industry:

Transportation > Infrastructure & Services (1.00)
Consumer Products & Services > Travel (1.00)
Leisure & Entertainment (0.93)
Education (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Measuring Bias of Web-filtered Text Datasets and Bias Propagation Through Training

Mansour, Youssef, Heckel, Reinhard

arXiv.org Artificial IntelligenceDec-3-2024

We investigate biases in pretraining datasets for large language models (LLMs) through dataset classification experiments. Building on prior work demonstrating the existence of biases in popular computer vision datasets, we analyze popular open-source pretraining datasets for LLMs derived from CommonCrawl including C4, RefinedWeb, DolmaCC, RedPajama-V2, FineWeb, and DCLM-Baseline. Despite those datasets being obtained with similar filtering and deduplication steps, neural networks can classify surprisingly well which dataset a single text sequence belongs to, significantly better than a human can. This indicates that popular pretraining datasets have their own unique biases or fingerprints. Those biases remain even when the text is rewritten with LLMs. Moreover, these biases propagate through training: Random sequences generated by models trained on those datasets can be classified well by a classifier trained on the original datasets.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2412.02857

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > Mexico (0.04)
(6 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Materials > Chemicals (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Law Enforcement & Public Safety (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Probing Language Models on Their Knowledge Source

Tighidet, Zineddine, Mogini, Andrea, Mei, Jiali, Piwowarski, Benjamin, Gallinari, Patrick

arXiv.org Artificial IntelligenceNov-9-2024

Large Language Models (LLMs) often encounter conflicts between their learned, internal (parametric knowledge, PK) and external knowledge provided during inference (contextual knowledge, CK). Understanding how LLMs models prioritize one knowledge source over the other remains a challenge. In this paper, we propose a novel probing framework to explore the mechanisms governing the selection between PK and CK in LLMs. Using controlled prompts designed to contradict the model's PK, we demonstrate that specific model activations are indicative of the knowledge source employed. We evaluate this framework on various LLMs of different sizes and demonstrate that mid-layer activations, particularly those related to relations in the input, are crucial in predicting knowledge source selection, paving the way for more reliable models capable of handling knowledge conflicts effectively.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2410.05817

Country:

Europe > Croatia (0.14)
North America > United States > Virginia (0.05)
Europe > Italy (0.05)
(17 more...)

Genre: Research Report > New Finding (0.68)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives

Ma, Lei, Yan, Ziyun, Li, Mengmeng, Liu, Tao, Tan, Liqin, Wang, Xuan, He, Weiqiang, Wang, Ruikun, He, Guangjun, Lu, Heng, Blaschke, Thomas

arXiv.org Artificial IntelligenceAug-2-2024

Deep learning has gained significant attention in remote sensing, especially in pixel- or patch-level applications. Despite initial attempts to integrate deep learning into object-based image analysis (OBIA), its full potential remains largely unexplored. In this article, as OBIA usage becomes more widespread, we conducted a comprehensive review and expansion of its task subdomains, with or without the integration of deep learning. Furthermore, we have identified and summarized five prevailing strategies to address the challenge of deep learning's limitations in directly processing unstructured object data within OBIA, and this review also recommends some important future research directions. Our goal with these endeavors is to inspire more exploration in this fascinating yet overlooked area and facilitate the integration of deep learning into OBIA processing workflows.

classification, remote sensing, segmentation, (15 more...)

arXiv.org Artificial Intelligence

2408.01607

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Middle East > Iraq > Kurdistan Region (0.14)
South America > Brazil (0.04)
(23 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.45)

Industry:

Government (0.67)
Health & Medicine (0.46)
Food & Agriculture > Agriculture (0.45)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback