Goto

Collaborating Authors

 weather


Can cloud seeding save us from water bankruptcy?

New Scientist

Can cloud seeding save us from water bankruptcy? We've long tried to control the weather by engineering rainfall. Now such cloud-seeding efforts are escalating, creating conflict between countries and stoking conspiracy theories. On a cold, windy night in November 2025, a quadcopter drone took off from a farm field at the foot of the Bannock mountain range north of Salt Lake City, rising 4000 metres into thick clouds. A fan with anti-icing propellers kicked into action, blowing yellow dust out of a cannister attached to the back of the drone. Cloud-seeding company Rainmaker was trying to fight dust with dust, spreading silver iodide powder to encourage precipitation and end the deadly dust storms plaguing Utah's capital.


Temporal Functional Circuits: From Spline Plots to Faithful Explanations in KAN Forecasting

arXiv.org Machine Learning

Unlike MLPs, Kolmogorov-Arnold Networks (KANs) expose explicit learnable edge functions on every connection, enabling mechanistic explanation in time-series forecasting. This paper introduces Temporal Functional Circuits, a framework that transforms KAN edge functions from latent visualizations into faithful, temporally grounded explanations. Built on a gated residual KAN that decomposes forecasts into a linear base and a sparsely activated KAN correction, the framework (i) maps each edge to input lags via output-aware attribution, (ii) ranks edges by learned activation range, and (iii) validates faithfulness through edge-level interventions including zeroing and spline removal. Removing the learned B-spline component while retaining the base SiLU term degrades forecasts, providing evidence that the spline shape itself carries predictive value beyond the base activation. On four synthetic regimes of increasing complexity, the learned gate opens progressively wider as signal complexity grows. On regime-switching signals, gated KAN achieves 59% lower MSE than linear-only models. Across eight benchmarks, the gated architecture is competitive with linear, attention, and MLP alternatives, while providing interpretable edge functions that MLP-based corrections cannot offer.


DecompKAN: Decomposed Patch-KAN for Long-Term Time Series Forecasting

arXiv.org Machine Learning

Accurate time series forecasting in scientific domains such as climate modeling, physiological monitoring, and energy systems benefits from both competitive predictions and model transparency: practitioners value understanding how a model transforms temporal features, not merely what it predicts. Transformer-based models achieve strong accuracy but their attention weights reveal only token-level relevance, not the functional transformations applied to each feature. This work proposes DECOMPKAN, a lightweight attention-free architecture that combines trend-residual decomposition, channel-wise patching, learned instance normalization, and B-spline Kolmogorov-Arnold Network (KAN) edge functions. Each KAN edge learns an explicit, inspectable 1D scalar function ϕ(x) over learned patch-embedding coordinates that can be directly visualized, offering a form of architectural transparency not directly available in attention-based or MLP-based architectures. On standard benchmarks, DECOMPKAN achieves best or tied-best MSE on 15 of 32 dataset-horizon combinations among selected published baselines, and achieves best or tied-best MSE on 20 of 36 comparisons (25 of 36 MAE; ties counted for all tied models) under a controlled same-recipe evaluation across 9 datasets including the physiological PPG-DaLiA benchmark. The architecture shows particular strength on datasets with smooth temporal dynamics (Solar 17%, ECL 10%vs.


Honesty Is the Best Policy: Defining and Mitigating AIDeception

Neural Information Processing Systems

Deceptive agents are a challenge for the safety, trustworthiness, and cooperation of AI systems. We focus on the problem that agents might deceive in order to achieve their goals (for instance, in our experiments with language models, the goal of being evaluated as truthful). There are a number of existing definitions of deception in the literature on game theory and symbolic AI, but there is no overarching theory of deception for learning agents in games. We introduce a formal definition of deception in structural causal games, grounded in the philosophy literature, and applicable to real-world machine learning systems. Several examples and results illustrate that our formal definition aligns with the philosophical and commonsense meaning of deception. Our main technical result is to provide graphical criteria for deception. We show, experimentally, that these results can be used to mitigate deception in reinforcement learning agents and language models.


Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving

Neural Information Processing Systems

In an era marked by the rapid scaling of foundation models, autonomous driving technologies are approaching a transformative threshold where end-to-end autonomous driving (E2E-AD) emerges due to its potential of scaling up in the data-driven manner. However, existing E2E-AD methods are mostly evaluated under the open-loop log-replay manner with L2 errors and collision rate as metrics (e.g., in nuScenes), which could not fully reflect the driving performance of algorithms as recently acknowledged in the community. For those E2E-AD methods evaluated under the closed-loop protocol, they are tested in fixed routes (e.g., Town05Long and Longest6 in CARLA) with the driving score as metrics, which is known for high variance due to the unsmoothed metric function and large randomness in the long route. Besides, these methods usually collect their own data for training, which makes algorithm-level fair comparison infeasible. To fulfill the paramount need of comprehensive, realistic, and fair testing environments for Full Self-Driving (FSD), we present Bench2Drive, the first benchmark for evaluating E2E-AD systems' multiple abilities in a closed-loop manner. Bench2Drive's official training data consists of 2 million fully annotated frames, collected from 10000 short clips uniformly distributed under 44 interactive scenarios (cut-in, overtaking, detour, etc), 23 weathers (sunny, foggy, rainy, etc), and 12 towns (urban, village, university, etc) in CARLA v2. Its evaluation protocol requires E2E-AD models to pass 44 interactive scenarios under different locations and weathers which sums up to 220 routes and thus provides a comprehensive and disentangled assessment about their driving capability under different situations. We implement state-of-the-art E2E-AD models and evaluate them in Bench2Drive, providing insights regarding current status and future directions.


Supplementary Material for CrossGNN: Confronting Noisy Multivariate Time Series Via Cross Interaction Refinement Anonymous Author(s) Affiliation Address email Appendix 1

Neural Information Processing Systems

Correlation mechanism to capture cross-time dependency for forecasting. Besides, the dimension of the channel is set to 16 based on efficiency considerations. Weather, and the look-back window size is set as 96. Proposition 2. The time and space complexity for the Cross-variable GNN is Frequency enhanced decomposed transformer for long-term series forecasting.



execution of SEVIR required several novel ideas and insights, including recognition of a gap in ML-ready weather

Neural Information Processing Systems

Thank you to each reviewer for your helpful feedback on our paper. Below we provide our reasoning for several selected points. Due to page limits, only a portion of the updated figure is shown below. TrajGRU) would be out of scope (and well over page count). The baselines we provide show that depending on your choice of loss function, certain axes of "goodness" are brought We will add more discussion along these lines which address "what is done and why".



UK to get brief respite from rain, forecasts show

BBC News

You would be forgiven for thinking the rain this year has been relentless - because in some parts of the UK, it actually has been. Here at BBC Weather we have been watching computer models closely for signs of when that pattern will change. These computer-generated forecasts go out about two weeks into the future - and models have often been hinting at a change to colder and drier weather on that timescale. However, they have then reverted to the familiar wet pattern as we have got closer to the time. Now though, there are stronger signals of a change for some of us - albeit perhaps only a temporary one.