Bamako
Three West African juntas have turned to Russia. Now the US wants to engage them
Three West African juntas have turned to Russia. The US has declared a stark policy shift towards three West African countries which are battling Islamist insurgents and whose military governments have broken defence ties with France and turned towards Russia. The state department announced that Nick Checker, head of its Bureau of African Affairs, would visit Mali's capital Bamako to convey the United States' respect for Mali's sovereignty and chart a new course in relations, moving past policy missteps. It adds that the US also looks forward to co-operating with Mali's allies, neighbouring Burkina Faso and Niger, on shared security and economic interests. Absent from the agenda is the longstanding American concern for democracy and human rights.
The Serendipity of Claude AI: Case of the 13 Low-Resource National Languages of Mali
Dembele, Alou, Coulibaly, Nouhoum Souleymane, Leventhal, Michael
However, most of the world's languages, often referred to as "low-resource languages", still remain either not supported or insufficiently supported due to the limited availability of data and language resources, and market, economic, and global inequality factors. Mali, a multilingual country with 13 official languages, including Bamanankan (Bambara), Bomu, Bozo, Dɔgɔsɔ (Dogon), Fulfulde (Fula), Hassaniya Arabic, Mamara (Minyanka), Maninka, Soninke, Sɔõɔy (Songhay), Senara, Tàmàsàyt (Tamasheq) and Xaasongaxanno (Kassonke), faces severe challenges in digital inclusion limiting economic development, educational advancement, and preservation of cultural heritage (Bird, 2020; Nekoto et al., 2020). These languages share in common a penury of language resources needed to train AI and NLP systems which could play a role in lessening the digital divide (Hammarström et al., 2018). This penury extends from severe in the case of a language like Bambara which has very limited resources to catastrophic for languages like Bomu and Bozo with an almost complete absence of language resources. The need for innovative methods for low-resource languages has spawned varied strategies, such as transfer learning, zero-shot learning, and pre-trained models in related languages (Ruder, 2021; Adelani et al., 2022).
Controllable Context Sensitivity and the Knob Behind It
Minder, Julian, Du, Kevin, Stoehr, Niklas, Monea, Giovanni, Wendler, Chris, West, Robert, Cotterell, Ryan
When making predictions, a language model must trade off how much it relies on its context vs. its prior knowledge. Choosing how sensitive the model is to its context is a fundamental functionality, as it enables the model to excel at tasks like retrieval-augmented generation and question-answering. In this paper, we search for a knob which controls this sensitivity, determining whether language models answer from the context or their prior knowledge. To guide this search, we design a task for controllable context sensitivity. In this task, we first feed the model a context (Paris is in England) and a question (Where is Paris?); we then instruct the model to either use its prior or contextual knowledge and evaluate whether it generates the correct answer for both intents (either France or England). When fine-tuned on this task, instruction-tuned versions of Llama-3.1, Mistral-v0.3, and Gemma-2 can solve it with high accuracy (85-95%). Analyzing these high-performing models, we narrow down which layers may be important to context sensitivity using a novel linear time algorithm. Then, in each model, we identify a 1-D subspace in a single layer that encodes whether the model follows context or prior knowledge. Interestingly, while we identify this subspace in a fine-tuned model, we find that the exact same subspace serves as an effective knob in not only that model but also non-fine-tuned instruct and base models of that model family. Finally, we show a strong correlation between a model's performance and how distinctly it separates context-agreeing from context-ignoring answers in this subspace. These results suggest a single subspace facilitates how the model chooses between context and prior knowledge, hinting at a simple fundamental mechanism that controls this behavior.
Machine Translation for Nko: Tools, Corpora and Baseline Results
Doumbouya, Moussa Koulako Bala, Diané, Baba Mamadi, Cissé, Solo Farabado, Diané, Djibrila, Sow, Abdoulaye, Doumbouya, Séré Moussa, Bangoura, Daouda, Bayo, Fodé Moriba, Condé, Ibrahima Sory 2., Diané, Kalo Mory, Piech, Chris, Manning, Christopher
Unfortunately, to over 40 million people across West African countries date, there isn't any usable machine translation including Mali, Guinea, Ivory Coast, Gambia, (MT) system for Nko, in part due to the unavailability Burkina Faso, Sierra Leone, Senegal, Liberia, and of large text corpora required by state-of-the-art Guinea-Bissau. Nko, which means'I say' in all neural machine translation (NMT) algorithms. Manding languages, was developed as both the Nko is a representative case study of the broader Manding literary standard language and a writing issues that interfere with the goal of universal machine system by Soulemana Kanté in 1949 for the translation. Thousands of languages still purpose of sustaining the strong oral tradition of don't have available or usable MT systems, mainly Manding languages (Niane, 1974; Conde, 2017; due to the unavailability of high-quality parallel Eberhard et al., 2023).
Predicting Question-Answering Performance of Large Language Models through Semantic Consistency
Rabinovich, Ella, Ackerman, Samuel, Raz, Orna, Farchi, Eitan, Anaby-Tavor, Ateret
Semantic consistency of a language model is broadly defined as the model's ability to produce semantically-equivalent outputs, given semantically-equivalent inputs. We address the task of assessing question-answering (QA) semantic consistency of contemporary large language models (LLMs) by manually creating a benchmark dataset with high-quality paraphrases for factual questions, and release the dataset to the community. We further combine the semantic consistency metric with additional measurements suggested in prior work as correlating with LLM QA accuracy, for building and evaluating a framework for factual QA reference-less performance prediction -- predicting the likelihood of a language model to accurately answer a question. Evaluating the framework on five contemporary LLMs, we demonstrate encouraging, significantly outperforming baselines, results.
Flickr Africa: Examining Geo-Diversity in Large-Scale, Human-Centric Visual Data
Naggita, Keziah, LaChance, Julienne, Xiang, Alice
Biases in large-scale image datasets are known to influence the performance of computer vision models as a function of geographic context. To investigate the limitations of standard Internet data collection methods in low- and middle-income countries, we analyze human-centric image geo-diversity on a massive scale using geotagged Flickr images associated with each nation in Africa. We report the quantity and content of available data with comparisons to population-matched nations in Europe as well as the distribution of data according to fine-grained intra-national wealth estimates. Temporal analyses are performed at two-year intervals to expose emerging data trends. Furthermore, we present findings for an ``othering'' phenomenon as evidenced by a substantial number of images from Africa being taken by non-local photographers. The results of our study suggest that further work is required to capture image data representative of African people and their environments and, ultimately, to improve the applicability of computer vision models in a global context.
Spectroscopy and Chemometrics/Machine-Learning News Weekly #6, 2023 – [:en]NIR Calibration Model[:de]NIR Calibration Model[:it]Modelli di Calibrazione NIR
Get the Spectroscopy and Chemometrics News Weekly in real time on Twitter @ CalibModel and follow us. "Component Prediction of Antai Pills Based on One-Dimensional Convolutional Neural Network and Near-Infrared Spectroscopy" LINK "Moisture content monitoring in withering leaves during black tea processing based on electronic eye and near infrared spectroscopy" LINK "Hyperspectral technique combined with stacking and blending ensemble learning method for detection of cadmium content in oilseed rape leaves" LINK "Capacitance spectroscopy enables realtime monitoring of early cell death in mammalian cell culture" LINK "Detection of bruised loquats based on reflectance, absorbance and Kubelka-Munk spectra" LINK "Longitudinal alterations of pulmonary [… formula…] O2 on-kinetics during moderate-intensity exercise in competitive youth cyclists are related to alterations in the …" LINK
Modelling spatio-temporal trends of air pollution in Africa
Gahungu, Paterne, Kubwimana, Jean Remy, Muhimpundu, Lionel Jean Marie Benjamin, Ndamuzi, Egide
Atmospheric pollution remains one of the major public health threat worldwide with an estimated 7 millions deaths annually. In Africa, rapid urbanization and poor transport infrastructure are worsening the problem. In this paper, we have analysed spatio-temporal variations of PM2.5 across different geographical regions in Africa. The West African region remains the most affected by the high levels of pollution with a daily average of 40.856 $\mu g/m^3$ in some cities like Lagos, Abuja and Bamako. In East Africa, Uganda is reporting the highest pollution level with a daily average concentration of 56.14 $\mu g/m^3$ and 38.65 $\mu g/m^3$ for Kigali. In countries located in the central region of Africa, the highest daily average concentration of PM2.5 of 90.075 $\mu g/m^3$ was recorded in N'Djamena. We compare three data driven models in predicting future trends of pollution levels. Neural network is outperforming Gaussian processes and ARIMA models.
Artificial intelligence locates "invisible" water in Mali and Chad
Using algorithms and artificial intelligence, a research team led by Universidad Complutense de Madrid (UCM) has designed a tool which, in its initial trials, proved capable of predicting those areas with best access to potable groundwater in Africa, with a success rate of close to 90%. In specific terms, the papers published in Hydrology and Earth System Science and Geocarto International describe the hydrogeological mapping performed by the MLMapper software in the regions of Bamako and Koulikoro (Mali) and the region of Ouaddaï (Chad), respectively. "Ensure access to water and sanitation for all" is Sustainable Development Goal 6. In sub-Saharan Africa, groundwater plays a fundamental role in the supply of drinking water, but the percentage of wells that strike water is very often lower than 30%. "This is mainly because of a lack of hydrogeological knowledge, with the practical consequence that millions of euros of humanitarian aid are lost in fruitless drilling operations", underlines Víctor Gómez-Escalonilla Canales, a researcher at UCM's Department of Geodynamics, Stratigraphy and Palaeontology.