Energy
Ukraine targets key Russian oil refinery as Moscow tests hypersonic missile
How is Russia replenishing its military? What is a'coalition of the willing'? How China forgot promises and'debts' to Ukraine How are Europe, the US pulling apart on Ukraine? Russia and Ukraine have been engaging in major aerial battles, targeting energy and transportation infrastructure, as Moscow presses its fierce ground assault in the Ukrainian east in the war's fourth year and tests a type of hypersonic weapon. Russia's Ministry of Defence announced on Sunday that its air defences shot down 361 drones, four guided aerial bombs, and rockets from a US-made high mobility artillery rocket system (HIMARS) overnight.
Yellowstone employees recover over 300 hats from hydrothermal areas
Be sure to hold on to your hats (and pizza) when near a boiling hot vent. Breakthroughs, discoveries, and DIY tips sent every weekday. No, it's your hat, ripped off your head by a gust of wind, spiraling off into the unknown. It's happened to the best of us. The only thing left to do is purchase another one before your face gets sunburnt .
Big Businesses Are Doing Carbon Dioxide Removal All Wrong
The technology is needed to limit global warming. But corporations are supporting it in lieu of reducing emissions. Achieving net-zero greenhouse gas emissions by 2050 will require removing carbon dioxide from the atmosphere, according to the Intergovernmental Panel on Climate Change, the world's foremost authority on the topic. But only some types of carbon removal are actually effective--and these are largely not the kind that major companies are investing in. A new report from the NewClimate Institute, a European think tank, finds that 35 of the world's biggest businesses are leaning on short-term tree-planting and other forms of "nondurable" carbon removal in order to say they've neutralized some of their climate pollution.
Ukraine strikes key Russian oil terminal in massive drone attack
Ukraine has struck Russia's largest oil terminal on the Baltic Sea during one of its biggest overnight drone attacks in months. The aerial assault targeted the Primorsk oil port in the Leningrad region, the final station of the Baltic Pipeline System and a crucial hub for Russia's maritime exports, Ukraine's security services told multiple outlets. More than half of the 221 drones sent to Russian territory were intercepted over the Bryansk and Smolensk regions, where Lukoil facilities were also reportedly targeted, the Russian defence ministry said. Meanwhile, officials said two civilians were killed in Ukraine's Sumy region when a Russian glide bomb struck a village near the border. Authorities in the Leningrad region said 28 drones were brought down and that a fire had broken out at a vessel and a pumping station in Primorsk.
LLM-Driven Adaptive 6G-Ready Wireless Body Area Networks: Survey and Framework
Torkamani, Mohammad Jalili, Mahmoudi, Negin, Kiashemshaki, Kiana
--Wireless Body Area Networks (WBANs) enable continuous monitoring of physiological signals for applications ranging from chronic disease management to emergency response. Recent advances in 6G communications, post-quantum cryptography, and energy harvesting have the potential to enhance WBAN performance. However, integrating these technologies into a unified, adaptive system remains a challenge. We propose a novel Large Language Model-driven adaptive WBAN framework in which a Large Language Model acts as a cognitive control plane, coordinating routing, physical layer selection, micro-energy harvesting, and post-quantum security in real time. Our review highlights the limitations of current heuristic-based designs and outlines a research agenda for resource-constrained, 6G-ready medical systems. This approach aims to enable ultra-reliable, secure, and self-optimizing WBANs for next-generation mobile health applications.
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Nadig, Rakesh, Arulchelvan, Vamanan, Bera, Rahul, Shahroodi, Taha, Singh, Gagandeep, Kakolyris, Andreas, Sadrosadati, Mohammad, Park, Jisung, Mutlu, Onur
Hybrid storage systems (HSS) integrate multiple storage devices with diverse characteristics to deliver high performance and capacity at low cost. The performance of an HSS highly depends on the effectiveness of two key policies: (1) the data-placement policy, which determines the best-fit storage device for incoming data, and (2) the data-migration policy, which dynamically rearranges stored data (i.e., prefetches hot data and evicts cold data) across the devices to sustain high HSS performance. Prior works optimize either data placement or data migration in isolation, which leads to suboptimal HSS performance. Unfortunately, no prior work tries to optimize both policies together. Our goal is to design a holistic data-management technique that optimizes both data-placement and data-migration policies to fully exploit the potential of an HSS, and thus significantly improve system performance. We propose Harmonia, a multi-agent reinforcement learning (RL)-based data-management technique that employs two lightweight autonomous RL agents, a data-placement agent and a data-migration agent, that adapt their policies for the current workload and HSS configuration while coordinating with each other to improve overall HSS performance. We evaluate Harmonia on real HSS configurations with up to four heterogeneous storage devices and seventeen data-intensive workloads. On performance-optimized (cost-optimized) HSS with two storage devices, Harmonia outperforms the best-performing prior approach by 49.5% (31.7%) on average. On an HSS with three (four) devices, Harmonia outperforms the best-performing prior work by 37.0% (42.0%) on average. Harmonia's performance benefits come with low latency (240ns for inference) and storage overheads (206 KiB in DRAM for both RL agents combined). We will open-source Harmonia's implementation to aid future research on HSS.
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Dai, Runpeng, Song, Linfeng, Liu, Haolin, Liang, Zhenwen, Yu, Dian, Mi, Haitao, Tu, Zhaopeng, Liu, Rui, Zheng, Tong, Zhu, Hongtu, Yu, Dong
Reinforcement Learning with Verifiable Rewards (RLVR) is a powerful paradigm for enhancing the reasoning ability of Large Language Models (LLMs). Yet current RLVR methods often explore poorly, leading to premature convergence and entropy collapse. To address this challenge, we introduce Curiosity-Driven Exploration (CDE), a framework that leverages the model's own intrinsic sense of curiosity to guide exploration. We formalize curiosity with signals from both the actor and the critic: for the actor, we use perplexity over its generated response, and for the critic, we use the variance of value estimates from a multi-head architecture. Both signals serve as an exploration bonus within the RLVR framework to guide the model. Our theoretical analysis shows that the actor-wise bonus inherently penalizes overconfident errors and promotes diversity among correct responses; moreover, we connect the critic-wise bonus to the well-established count-based exploration bonus in RL. Empirically, our method achieves an approximate +3 point improvement over standard RLVR using GRPO/PPO on AIME benchmarks. Further analysis identifies a calibration collapse mechanism within RLVR, shedding light on common LLM failure modes.
Conditioning on PDE Parameters to Generalise Deep Learning Emulation of Stochastic and Chaotic Dynamics
Shokar, Ira J. S., Kerswell, Rich R., Haynes, Peter H.
We present a deep learning emulator for stochastic and chaotic spatio-temporal systems, explicitly conditioned on the parameter values of the underlying partial differential equations (PDEs). Our approach involves pre-training the model on a single parameter domain, followed by fine-tuning on a smaller, yet diverse dataset, enabling generalisation across a broad range of parameter values. By incorporating local attention mechanisms, the network is capable of handling varying domain sizes and resolutions. This enables computationally efficient pre-training on smaller domains while requiring only a small additional dataset to learn how to generalise to larger domain sizes. We demonstrate the model's capabilities on the chaotic Kuramoto-Sivashinsky equation and stochastically-forced beta-plane turbulence, showcasing its ability to capture phenomena at interpolated parameter values. The emulator provides significant computational speed-ups over conventional numerical integration, facilitating efficient exploration of parameter space, while a probabilistic variant of the emulator provides uncertainty quantification, allowing for the statistical study of rare events.
Human-in-the-loop Learning Through Decentralized Communication Mechanisms
Information sharing platforms like TripAdvisor and Waze involve human agents as both information producers and consumers. All these platforms operate in a centralized way to collect agents' latest observations of new options (e.g., restaurants, hotels, travel routes) and share such information with all in real time. However, after hearing the central platforms' live updates, many human agents are found selfish and unwilling to further explore unknown options for the benefit of others in the long run. To regulate the human-in-the-loop learning (HILL) game against selfish agents' free-riding, this paper proposes a paradigm shift from centralized to decentralized way of operation that forces agents' local explorations through restricting information sharing. When game theory meets distributed learning, we formulate our decentralized communication mechanism's design as a new multi-agent Markov decision process (MA-MDP), and derive its analytical condition to outperform today's centralized operation. As the optimal decentralized communication mechanism in MA-MDP is NP-hard to solve, we present an asymptotically optimal algorithm with linear complexity to determine the mechanism's timing of intermittent information sharing. Then we turn to non-myopic agents who may revert to even over-explore, and adapt our mechanism design to work. Simulation experiments using real-world dataset demonstrate the effectiveness of our decentralized mechanisms for various scenarios.
From scratch to silver: Creating trustworthy training data for patent-SDG classification using Large Language Models
Ascione, Grazia Sveva, Tamagnone, Nicolò
Classifying patents by their relevance to the UN Sustainable Development Goals (SDGs) is crucial for tracking how innovation addresses global challenges. However, the absence of a large, labeled dataset limits the use of supervised learning. Existing methods, such as keyword searches, transfer learning, and citation-based heuristics, lack scalability and generalizability. This paper frames patent-to-SDG classification as a weak supervision problem, using citations from patents to SDG-tagged scientific publications (NPL citations) as a noisy initial signal. To address its sparsity and noise, we develop a composite labeling function (LF) that uses large language models (LLMs) to extract structured concepts, namely functions, solutions, and applications, from patents and SDG papers based on a patent ontology. Cross-domain similarity scores are computed and combined using a rank-based retrieval approach. The LF is calibrated via a custom positive-only loss that aligns with known NPL-SDG links without penalizing discovery of new SDG associations. The result is a silver-standard, soft multi-label dataset mapping patents to SDGs, enabling the training of effective multi-label regression models. We validate our approach through two complementary strategies: (1) internal validation against held-out NPL-based labels, where our method outperforms several baselines including transformer-based models, and zero-shot LLM; and (2) external validation using network modularity in patent citation, co-inventor, and co-applicant graphs, where our labels reveal greater thematic, cognitive, and organizational coherence than traditional technological classifications. These results show that weak supervision and semantic alignment can enhance SDG classification at scale.