Niamey
Are drones, AI making it harder to fight armed groups in the Sahel?
Are drones, AI making it harder to fight armed groups in the Sahel? The brazen attack on the international airport and nearby military airbase in Niamey, Niger's capital, came overnight between January 28 and 29. Balls of orange fire flew across the sky as the Nigerien army attempted to respond while residents ducked for cover and whispered prayers, as shown in videos on social media. ISIL (ISIS) in Sahel Province, or ISSP - a Niger-based outfit earlier known as the ISIL affiliate in the Greater Sahara or ISGS - has since claimed responsibility and says it killed several soldiers, although the Nigerien army disputes this. Many of its fighters had breached military drone hangars using RPGs and mortars, and managed to damage several aircraft and one civilian aeroplane, according to videos from the group.
US to announce 'substantial' Russia sanctions
US to announce'substantial' Russia sanctions The US government will impose a substantial pickup in sanctions against Russia as the fighting in Ukraine continues, according to US Treasury Secretary Scott Bessent. Bessent's comments came just before Nato Secretary-General Mark Rutte was due at the White House, in which he said he hopes to discuss how to deliver Trump's vision of peace in the conflict. Earlier in the day, Rutte said he believes that Trump is the only one who can get this done. At least seven people were killed, including two children, during intense Russian drone and missile strikes on Ukraine - just hours after Trump said plans for a meeting with Vladimir Putin in Budapest had been shelved. Bessent provided no further details on the incoming sanctions, but said they would be announced either after the close this afternoon or first thing tomorrow morning.
EAP-GP: Mitigating Saturation Effect in Gradient-based Automated Circuit Identification
Zhang, Lin, Dong, Wenshuo, Zhang, Zhuoran, Yang, Shu, Hu, Lijie, Liu, Ninghao, Zhou, Pan, Wang, Di
Understanding the internal mechanisms of transformer-based language models remains challenging. Mechanistic interpretability based on circuit discovery aims to reverse engineer neural networks by analyzing their internal processes at the level of computational subgraphs. In this paper, we revisit existing gradient-based circuit identification methods and find that their performance is either affected by the zero-gradient problem or saturation effects, where edge attribution scores become insensitive to input changes, resulting in noisy and unreliable attribution evaluations for circuit components. To address the saturation effect, we propose Edge Attribution Patching with GradPath (EAP-GP), EAP-GP introduces an integration path, starting from the input and adaptively following the direction of the difference between the gradients of corrupted and clean inputs to avoid the saturated region. This approach enhances attribution reliability and improves the faithfulness of circuit identification. We evaluate EAP-GP on 6 datasets using GPT-2 Small, GPT-2 Medium, and GPT-2 XL. Experimental results demonstrate that EAP-GP outperforms existing methods in circuit faithfulness, achieving improvements up to 17.7%. Comparisons with manually annotated ground-truth circuits demonstrate that EAP-GP achieves precision and recall comparable to or better than previous approaches, highlighting its effectiveness in identifying accurate circuits.
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
Keita, Mamadou K., Homan, Christopher, Hamani, Sofiane Abdoulaye, Bremang, Adwoa, Zampieri, Marcos, Alfari, Habibatou Abdoulaye, Ibrahim, Elysabhete Amadou, Owusu, Dennis
Grammatical error correction (GEC) is important for improving written materials for low-resource languages like Zarma -- spoken by over 5 million people in West Africa. Yet it remains a challenging problem. This study compares rule-based methods, machine translation (MT) models, and large language models (LLMs) for GEC in Zarma. We evaluate each approach's effectiveness on our manually-built dataset of over 250,000 examples using synthetic and human-annotated data. Our experiments show that the MT-based approach using the M2M100 model outperforms others, achieving a detection rate of 95.82% and a suggestion accuracy of 78.90% in automatic evaluations, and scoring 3.0 out of 5.0 in logical/grammar error correction during MEs by native speakers. The rule-based method achieved perfect detection (100%) and high suggestion accuracy (96.27%) for spelling corrections but struggled with context-level errors. LLMs like MT5-small showed moderate performance with a detection rate of 90.62% and a suggestion accuracy of 57.15%. Our work highlights the potential of MT models to enhance GEC in low-resource languages, paving the way for more inclusive NLP tools.
IceCloudNet: 3D reconstruction of cloud ice from Meteosat SEVIRI
Jeggle, Kai, Czerkawski, Mikolaj, Serva, Federico, Saux, Bertrand Le, Neubauer, David, Lohmann, Ulrike
IceCloudNet is a novel method based on machine learning able to predict high-quality vertically resolved cloud ice water contents (IWC) and ice crystal number concentrations (N$_\textrm{ice}$). The predictions come at the spatio-temporal coverage and resolution of geostationary satellite observations (SEVIRI) and the vertical resolution of active satellite retrievals (DARDAR). IceCloudNet consists of a ConvNeXt-based U-Net and a 3D PatchGAN discriminator model and is trained by predicting DARDAR profiles from co-located SEVIRI images. Despite the sparse availability of DARDAR data due to its narrow overpass, IceCloudNet is able to predict cloud occurrence, spatial structure, and microphysical properties with high precision. The model has been applied to ten years of SEVIRI data, producing a dataset of vertically resolved IWC and N$_\textrm{ice}$ of clouds containing ice with a 3 kmx3 kmx240 mx15 minute resolution in a spatial domain of 30{\deg}W to 30{\deg}E and 30{\deg}S to 30{\deg}N. The produced dataset increases the availability of vertical cloud profiles, for the period when DARDAR is available, by more than six orders of magnitude and moreover, IceCloudNet is able to produce vertical cloud profiles beyond the lifetime of the recently ended satellite missions underlying DARDAR.
Can the US find new partners in West Africa after Niger exit?
Following 11 years of defence cooperation and millions of dollars spent on maintaining military bases, the United States officially pulled its troops out of Niger this week in a surprise divorce that experts are calling a "blow" to Washington's ambitions for influence in the troubled Sahel region of West Africa. Once-close relations between the two countries saw the US establish large, expensive military bases from which it launched surveillance drones in Niger to monitor myriad armed groups linked to al-Qaeda and ISIL (ISIS). However, those ties collapsed in March when Niger's military government, which seized power in July 2023, cancelled a decade-long security agreement and told the US, which was pushing for a transition to civilian rule, to remove its 1,100 military personnel stationed there by September 15. For months, the US has failed to either fully align with or outright oppose the ruling military, analysts say. On the one hand, Washington seemed ready to maintain defence relations with the new ruling power, but on the other, it felt compelled to denounce the coup and pause aid to Niger.
Machine learning models for daily rainfall forecasting in Northern Tropical Africa using tropical wave predictors
Satheesh, Athul Rasheeda, Knippertz, Peter, Fink, Andreas H.
Numerical weather prediction (NWP) models often underperform compared to simpler climatology-based precipitation forecasts in northern tropical Africa, even after statistical postprocessing. AI-based forecasting models show promise but have avoided precipitation due to its complexity. Synoptic-scale forcings like African easterly waves and other tropical waves (TWs) are important for predictability in tropical Africa, yet their value for predicting daily rainfall remains unexplored. This study uses two machine-learning models--gamma regression and a convolutional neural network (CNN)--trained on TW predictors from satellite-based GPM IMERG data to predict daily rainfall during the July-September monsoon season. Predictor variables are derived from the local amplitude and phase information of seven TW from the target and up-and-downstream neighboring grids at 1-degree spatial resolution. The ML models are combined with Easy Uncertainty Quantification (EasyUQ) to generate calibrated probabilistic forecasts and are compared with three benchmarks: Extended Probabilistic Climatology (EPC15), ECMWF operational ensemble forecast (ENS), and a probabilistic forecast from the ENS control member using EasyUQ (CTRL EasyUQ). The study finds that downstream predictor variables offer the highest predictability, with downstream tropical depression (TD)-type wave-based predictors being most important. Other waves like mixed-Rossby gravity (MRG), Kelvin, and inertio-gravity waves also contribute significantly but show regional preferences. ENS forecasts exhibit poor skill due to miscalibration. CTRL EasyUQ shows improvement over ENS and marginal enhancement over EPC15. Both gamma regression and CNN forecasts significantly outperform benchmarks in tropical Africa. This study highlights the potential of ML models trained on TW-based predictors to improve daily precipitation forecasts in tropical Africa.
US hands last base in Niger to military junta
Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. The U.S. handed over its last military base in Niger -- one of two crucial hubs for American counterterrorism operations in the country -- to local authorities, the U.S. Department of Defense and Niger's Ministry of Defense announced in a joint statement on Monday. The handing over of Airbase 201 in the city of Agadez came after the U.S. troops withdrew earlier this month from Airbase 101, a small drone base in Niger's capital of Niamey. U.S. troops have until Sept. 15 to leave the Sahel country following an agreement with Nigerien authorities.
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation
Dong, Guanting, Zhu, Yutao, Zhang, Chenghao, Wang, Zechen, Dou, Zhicheng, Wen, Ji-Rong
Retrieval-augmented generation (RAG) has demonstrated effectiveness in mitigating the hallucination problem of large language models (LLMs). However, the difficulty of aligning the retriever with the diverse LLMs' knowledge preferences inevitably poses an inevitable challenge in developing a reliable RAG system. To address this issue, we propose DPA-RAG, a universal framework designed to align diverse knowledge preferences within RAG systems. Specifically, we initially introduce a preference knowledge construction pipline and incorporate five novel query augmentation strategies to alleviate preference data scarcity. Based on preference data, DPA-RAG accomplishes both external and internal preference alignment: 1) It jointly integrate pair-wise, point-wise, and contrastive preference alignment abilities into the reranker, achieving external preference alignment among RAG components. 2) It further introduces a pre-aligned stage before vanilla Supervised Fine-tuning (SFT), enabling LLMs to implicitly capture knowledge aligned with their reasoning preferences, achieving LLMs' internal alignment. Experimental results across four knowledge-intensive QA datasets demonstrate that DPA-RAG outperforms all baselines and seamlessly integrates both black-box and open-sourced LLM readers. Further qualitative analysis and discussions also provide empirical guidance for achieving reliable RAG systems. Our code is publicly available at https://github.com/dongguanting/DPA-RAG.
Submeter-level Land Cover Mapping of Japan
Yokoya, Naoto, Xia, Junshi, Broni-Bediako, Clifford
Deep learning has shown promising performance in submeter-level mapping tasks; however, the annotation cost of submeter-level imagery remains a challenge, especially when applied on a large scale. In this paper, we present the first submeter-level land cover mapping of Japan with eight classes, at a relatively low annotation cost. We introduce a human-in-the-loop deep learning framework leveraging OpenEarthMap, a recently introduced benchmark dataset for global submeter-level land cover mapping, with a U-Net model that achieves national-scale mapping with a small amount of additional labeled data. By adding a small amount of labeled data of areas or regions where a U-Net model trained on OpenEarthMap clearly failed and retraining the model, an overall accuracy of 80\% was achieved, which is a nearly 16 percentage point improvement after retraining. Using aerial imagery provided by the Geospatial Information Authority of Japan, we create land cover classification maps of eight classes for the entire country of Japan. Our framework, with its low annotation cost and high-accuracy mapping results, demonstrates the potential to contribute to the automatic updating of national-scale land cover mapping using submeter-level optical remote sensing data. The mapping results will be made publicly available.