Niger
EAP-GP: Mitigating Saturation Effect in Gradient-based Automated Circuit Identification
Zhang, Lin, Dong, Wenshuo, Zhang, Zhuoran, Yang, Shu, Hu, Lijie, Liu, Ninghao, Zhou, Pan, Wang, Di
Understanding the internal mechanisms of transformer-based language models remains challenging. Mechanistic interpretability based on circuit discovery aims to reverse engineer neural networks by analyzing their internal processes at the level of computational subgraphs. In this paper, we revisit existing gradient-based circuit identification methods and find that their performance is either affected by the zero-gradient problem or saturation effects, where edge attribution scores become insensitive to input changes, resulting in noisy and unreliable attribution evaluations for circuit components. To address the saturation effect, we propose Edge Attribution Patching with GradPath (EAP-GP), EAP-GP introduces an integration path, starting from the input and adaptively following the direction of the difference between the gradients of corrupted and clean inputs to avoid the saturated region. This approach enhances attribution reliability and improves the faithfulness of circuit identification. We evaluate EAP-GP on 6 datasets using GPT-2 Small, GPT-2 Medium, and GPT-2 XL. Experimental results demonstrate that EAP-GP outperforms existing methods in circuit faithfulness, achieving improvements up to 17.7%. Comparisons with manually annotated ground-truth circuits demonstrate that EAP-GP achieves precision and recall comparable to or better than previous approaches, highlighting its effectiveness in identifying accurate circuits.
Feriji: A French-Zarma Parallel Corpus, Glossary & Translator
Keita, Mamadou K., Ibrahim, Elysabhete Amadou, Alfari, Habibatou Abdoulaye, Homan, Christopher
Machine translation (MT) is a rapidly expanding field that has experienced significant advancements in recent years with the development of models capable of translating multiple languages with remarkable accuracy. However, the representation of African languages in this field still needs to improve due to linguistic complexities and limited resources. This applies to the Zarma language, a dialect of Songhay (of the Nilo-Saharan language family) spoken by over 5 million people across Niger and neighboring countries \cite{lewis2016ethnologue}. This paper introduces Feriji, the first robust French-Zarma parallel corpus and glossary designed for MT. The corpus, containing 61,085 sentences in Zarma and 42,789 in French, and a glossary of 4,062 words represent a significant step in addressing the need for more resources for Zarma. We fine-tune three large language models on our dataset, obtaining a BLEU score of 30.06 on the best-performing model. We further evaluate the models on human judgments of fluency, comprehension, and readability and the importance and impact of the corpus and models. Our contributions help to bridge a significant language gap and promote an essential and overlooked indigenous African language.
Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers
Berger, Alexander H., Lux, Laurin, Shit, Suprosanna, Ezhov, Ivan, Kaissis, Georgios, Menten, Martin J., Rueckert, Daniel, Paetzold, Johannes C.
Direct image-to-graph transformation is a challenging task that solves object detection and relationship prediction in a single model. Due to the complexity of this task, large training datasets are rare in many domains, which makes the training of large networks challenging. This data sparsity necessitates the establishment of pre-training strategies akin to the state-of-the-art in computer vision. In this work, we introduce a set of methods enabling cross-domain and cross-dimension transfer learning for image-to-graph transformers. We propose (1) a regularized edge sampling loss for sampling the optimal number of object relationships (edges) across domains, (2) a domain adaptation framework for image-to-graph transformers that aligns features from different domains, and (3) a simple projection function that allows us to pretrain 3D transformers on 2D input data. We demonstrate our method's utility in cross-domain and cross-dimension experiments, where we pretrain our models on 2D satellite images before applying them to vastly different target domains in 2D and 3D. Our method consistently outperforms a series of baselines on challenging benchmarks, such as retinal or whole-brain vessel graph extraction.
Machine Intelligence in Africa: a survey
Tapo, Allahsera Auguste, Traore, Ali, Danioko, Sidy, Tembine, Hamidou
In the last 5 years, the availability of large audio datasets in African countries has opened unlimited opportunities to build machine intelligence (MI) technologies that are closer to the people and speak, learn, understand, and do businesses in local languages, including for those who cannot read and write. Unfortunately, these audio datasets are not fully exploited by current MI tools, leaving several Africans out of MI business opportunities. Additionally, many state-of-the-art MI models are not culture-aware, and the ethics of their adoption indexes are questionable. The lack thereof is a major drawback in many applications in Africa. This paper summarizes recent developments in machine intelligence in Africa from a multi-layer multiscale and culture-aware ethics perspective, showcasing MI use cases in 54 African countries through 400 articles on MI research, industry, government actions, as well as uses in art, music, the informal economy, and small businesses in Africa. The survey also opens discussions on the reliability of MI rankings and indexes in the African continent as well as algorithmic definitions of unclear terms used in MI.
US military resumes drone, crewed aircraft operations in post-coup Niger
The United States military has resumed operations in Niger, flying drones and other aircraft out of airbases in the country more than a month after a coup halted activities, the head of Air Forces in Europe and Air Forces Africa said. Since the July coup that removed President Mohamed Bazoum, the approximately 1,100 US soldiers deployed in the West African country have been confined to their military bases. General James Hecker said on Wednesday that negotiations with the military rulers of Niger resulted in some intelligence and surveillance missions resuming. "For a while, we weren't doing any missions on the bases, they pretty much closed down the airfields," Hecker told reporters at the annual Air and Space Forces Association convention. "Through the diplomatic process, we are now doing, I wouldn't say 100 percent of the missions that we were doing before, but we're doing a large amount of missions that we're doing before," he said.
Niger coup: Will the West change its security approach to the Sahel?
Until the recent military takeover in July 2023, Niger had played a key role in the security architecture of the West, particularly the United States and France, in the Sahel region. Niger hosts US and French military bases, while international support in different fields has increased exponentially in recent years. For example, take the 500 million euros ($546m) provided by the European Union in 2021, 120 million euros ($131m) of aid from France in 2022 or $150m of direct aid announced during US Secretary of State Antony Blinken's visit to Niamey in March 2023. This is one of the reasons why Niger had a relatively secure environment that did not allow violent armed activities to a large extent. Even though casualties from "terrorist attacks" increased worldwide after 2021, the loss of civilians in Niger decreased by 80 percent in 2022.
Here's How Small Farmers Across Africa Are Bringing Back Trees
A farmer in Niger tends to a tree sprout growing among his millet crop.Tony Rinaudo/World Vision Australia This story was originally published by Yale Environment 360 and is reproduced here as part of the Climate Desk collaboration. For decades, there have been reports of the deforestation in Africa. And they are true--the continent's forests are disappearing, lost mainly to expanding agriculture, logging, and charcoal-making. Maybe not, according to new satellite data analyzed by artificial intelligence and a growing body of on-the-ground studies. This new research is finding ever more trees outside forests, many of them nurtured by farmers and sprouting on their previously treeless fields.
NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track
Gow-Smith, Edward, Berard, Alexandre, Boito, Marcely Zanon, Calapodescu, Ioan
This paper presents NAVER LABS Europe's systems for Tamasheq-French and Quechua-Spanish speech translation in the IWSLT 2023 Low-Resource track. Our work attempts to maximize translation quality in low-resource settings using multilingual parameter-efficient solutions that leverage strong pre-trained models. Our primary submission for Tamasheq outperforms the previous state of the art by 7.5 BLEU points on the IWSLT 2022 test set, and achieves 23.6 BLEU on this year's test set, outperforming the second best participant by 7.7 points. For Quechua, we also rank first and achieve 17.7 BLEU, despite having only two hours of translation data. Finally, we show that our proposed multilingual architecture is also competitive for high-resource languages, outperforming the best unconstrained submission to the IWSLT 2021 Multilingual track, despite using much less training data and compute.
Dynamic Collaborative Multi-Agent Reinforcement Learning Communication for Autonomous Drone Reforestation
We approach autonomous drone-based reforestation with a collaborative multi-agent reinforcement learning (MARL) setup. Agents can communicate as part of a dynamically changing network. We explore collaboration and communication on the back of a high-impact problem. Forests are the main resource to control rising CO2 conditions. Unfortunately, the global forest volume is decreasing at an unprecedented rate. Many areas are too large and hard to traverse to plant new trees. To efficiently cover as much area as possible, here we propose a Graph Neural Network (GNN) based communication mechanism that enables collaboration. Agents can share location information on areas needing reforestation, which increases viewed area and planted tree count. We compare our proposed communication mechanism with a multi-agent baseline without the ability to communicate. Results show how communication enables collaboration and increases collective performance, planting precision and the risk-taking propensity of individual agents.
Forty fighters 'neutralised' in drone strikes in Niger
French drone strikes have killed nearly 40 fighters earlier travelling on motorcycles near Niger's border with Burkina Faso, France's military said on Thursday. In a statement, the French military called the strikes a "new tactical success" for France's counterterrorism efforts in Africa's Sahel region, named Operation Barkhane. "Intelligence obtained from Nigerien units in contact with the column confirmed that the motorcycles belonged to an armed terrorist group moving between Burkina Faso and Niger," Barkhane said in the statement. "In close coordination with Niger's Armed Forces, the Barkhane force conducted several strikes against the column. Nearly 40 terrorists were neutralised."