Genre
Hedging Memory Horizons for Non-Stationary Prediction via Online Aggregation
Wang, Yutong, Goude, Yannig, Yao, Qiwei
We study online prediction under distribution shift, where inputs arrive chronologically and outcomes are revealed only after prediction. In this setting, predictors must remain stable in quiet regimes yet adapt when regimes shift, and the right adaptation memory is unknown in advance. We propose MELO (Memory-hedged Exponentially Weighted Least-Squares Online aggregation), a model-agnostic method that hedges across adaptation scales: it wraps any non-anticipating base-predictor pool with exponentially weighted least-squares (EWLS) adaptation experts at multiple forgetting factors, and aggregates raw and EWLS-adapted forecasts with MLpol which is a parameter-free online aggregation rule. Under boundedness conditions, we establish deterministic oracle inequalities showing that it competes with both the best raw predictor and the best bounded, time-varying affine combinations of the base predictions, up to a path-length-dependent tracking cost and a sublinear aggregation overhead. We evaluate MELO on French national electricity-load forecasting through the COVID-19 lockdown using no regime indicators, lockdown dates, or policy covariates. MELO reduces overall RMSE by 34.7%relative to base-only MLpol and achieves lower overall RMSE than a TabICL reference supplied with an external COVID policy-response covariate. MELO requires only lightweight per-step recursive updates without model retraining.
Dynamic Treatment on Networks
Nar, Bengusu, Li, Jiguang, Ročková, Veronika, Toulis, Panos
In networks, effective dynamic treatment allocation requires deciding both whom to treat and also when, so as to amplify policy impact through spillovers. An early intervention at a well-connected node can trigger cascades that change which nodes are worth targeting in the next period. Existing treatment strategies under network interference are largely static while dynamic treatment frameworks typically ignore network structure altogether. We integrate these perspectives and propose Q-Ising, a three-stage pipeline that (i) estimates network adoption dynamics via a Bayesian dynamic Ising model from a single observed panel, (ii) augments treatment adoption histories with continuous posterior latent states, and (iii) learns a dynamic policy via offline reinforcement learning. The Bayesian mechanism enables uncertainty quantification over dynamic decisions, yielding posterior ensemble policies with interpretable spillover estimates. We provide a finite-sample regret upper bound that decomposes into standard offline-RL uncertainty, network abstraction error, and first stage error in Ising state estimation. We apply our method to data from Indian village microfinance networks and synthetic stochastic block models under simulated heterogeneous susceptible-infected-susceptible (SIS) dynamics and demonstrate that adaptive targeting outperforms static centrality benchmarks.
A Geometry-Aware Residual Correction of Hagan's SABR Implied Volatility Formula
Reghai, Adil, Tarsissi, Lama, Biau, Gérard, Lipton, Alex
This paper proposes a hybrid methodology to improve the approximation of SABR (Stochastic Alpha Beta Rho) implied volatility by combining analytical structure with machine learning. The approach augments the neural-network input representation with geometric features derived from the stochastic differential equations of the SABR model. Unlike approaches that fully replace analytical formulas with black-box models, the proposed framework preserves the analytical backbone of the model. The hybridization operates along two complementary dimensions. First, geometry-aware variables reflecting intrinsic properties of the SABR dynamics are used as structured inputs to the network. Second, the neural network is trained to learn the residual error relative to Hagan's closed-form approximation rather than implied volatility directly. The resulting model acts as a structured residual correction to the analytical formula, retaining interpretability while capturing higher-order effects that are not included in the asymptotic expansion. Numerical experiments conducted over realistic parameter domains, as well as stressed environments, show that the method improves accuracy and robustness compared with both analytical approximations and standard neural-network approaches. Because the correction remains lightweight and structurally consistent with the underlying model, the framework is well suited for real-time pricing and calibration in practical trading environments.
DARTS: Targeting Prognostic Covariates in Budget-Constrained Sequential Experiments
Husar, Kateryna, Volfovsky, Alexander
Randomized controlled trials typically assume that prognostic covariates are known and available at no cost. In practice, obtaining high-dimensional pretreatment data is costly, forcing a trade-off between covariate-adaptive precision and a measurement budget. We introduce Dynamic Adaptive Rerandomization via Thompson Sampling (DARTS), which treats covariate acquisition as a sequential optimization problem embedded within a design-based causal inference task. A budgeted combinatorial Thompson sampler learns which covariates are most prognostic across successive batches; selected covariates then drive rerandomization and regression adjustment to reduce batch-level average treatment effect variance. Our primary theoretical contribution is a decoupling result: adaptive covariate selection based on past batches preserves batch-level randomization validity, and the cumulative inverse-variance weighted estimator achieves at least nominal asymptotic coverage. We further derive a Bayes risk bound for the acquisition layer that matches the minimax lower bound up to logarithmic factors. Empirically, DARTS systematically concentrates the budget on informative features, significantly closing the efficiency gap to oracle designs while maintaining strict inferential validity.
Transformers Efficiently Perform In-Context Logistic Regression via Normalized Gradient Descent
One widely recognized interpretation for their empirical success is their ability to perform in-context learning (ICL): pretrained transformers are capable of performing previously unseen tasks based on demonstrations and examples in the prompt, without requiring any additional task-specific fine-tuning (Brown et al., 2020). A line of recent works interpret the in-context learning (ICL) capability of transformers from an algorithmic perspective, viewing transformers as models that can implicitly execute certain learning algorithms on the context examples. Specifically, Garg et al. (2022) proposes a theoretical framework for ICL in terms of learning a hypothesis class, and empirically shows that transformers can in-context learn the linear function class. Motivated by this empirical finding, several recent works attempt to theoretically study how transformers perform in-context learning on linear regression tasks. Aky urek et al. (2022); Von Oswald et al. (2023) construct multi-layer transformers with linear attention that can execute gradient descent on the an "in-context loss" defined on the context data, thereby enabling in-context learning of linear regression.
The Structural Origin of Attention Sink: Variance Discrepancy, Super Neurons, and Dimension Disparity
Li, Siquan, Jiang, Kaiqi, Sun, Jiacheng, Hu, Tianyang
Despite the prevalence of the attention sink phenomenon in Large Language Models (LLMs), where initial tokens disproportionately monopolize attention scores, its structural origins remain elusive. This work provides a \textit{mechanistic explanation} for this phenomenon. First, we trace its root to the value aggregation process inherent in self-attention, which induces a systematic variance discrepancy. We further demonstrate that this discrepancy is drastically amplified by the activation of super neurons within Feed-Forward Network (FFN) layers. Specifically, the channel-sparse down-projections trigger a dimension disparity of the first-token representation, necessitating the formation of attention sinks as a structural anchor. Then, we validate this causal chain through two controlled interventions: (i) isolating the aggregation effect via attention mask modifications and (ii) amplifying the variance of targeted token representations. Both interventions can replicate attention sinks at arbitrary positions. Our mechanistic understanding offers a foundation for the systematic control of sink formation. Finally, as a proof of concept, we propose \textit{head-wise RMSNorm}, an architectural modification that stabilizes value aggregation outputs during pre-training. Our experiments demonstrate that restoring statistical parity across positions significantly accelerates convergence.
Online Bayesian Calibration under Gradual and Abrupt System Changes
Bayesian model calibration is central to digital twins and computer experiments, as it aligns model outputs with field observations by estimating calibration parameters and correcting systematic model bias. Classical Bayesian calibration introduces latent parameters and a discrepancy function to model bias, but suffers from parameter--discrepancy confounding and is typically formulated as an offline procedure under a stationary data-generating assumption. These limitations are restrictive in modern digital twin applications, where systems evolve over time and may exhibit gradual drift and abrupt regime shifts. While data assimilation methods enable sequential updates, they generally do not explicitly model systematic bias and are less effective under abrupt changes. We propose Bayesian Recursive Projected Calibration (BRPC), an online Bayesian calibration framework for streaming data under simulator mismatch and nonstationarity. BRPC extends projected calibration to the online setting by separating a discrepancy-free particle update for calibration parameters from a conditional Gaussian process update for discrepancy, preserving identifiability while enabling bias-aware adaptation under gradual system evolution. To handle abrupt changes, BRPC is integrated with restart mechanisms that detect regime shifts and reset the calibration process. We establish theoretical guarantees for both components, including tracking performance under gradual evolution and false-alarm and detection behavior for restart mechanisms. Empirical studies on synthetic and plant-simulation benchmarks show that BRPC improves calibration accuracy under gradual changes, while restart-augmented BRPC further improves robustness and predictive performance under abrupt regime shifts compared to sliding-window Bayesian calibration and data assimilation baselines.
Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained
Today on, we're diving into recent reports that the Trump administration is considering an executive order that would establish some sort of federal oversight over new AI models. This week on, the team discusses the surprising reports of the Trump administration seemingly reversing its stance when it comes to AI safety and regulation. We also look into what exactly is going on with the Hantavirus outbreak, and whether you should be worried. Also, we get into the story of how a former federal employee who was ousted by Elon Musk's so-called Department of Government Efficiency is now running for office. Plus, a Spirit Airlines laid off employee shares with us how they experienced the company's shutdown news last weekend and what they'll miss most about the job. A Federal Worker Was Fired for Filming DOGE. Write to us at [email protected] . You can always listen to this week's podcast through the audio player on this page, but if you want to subscribe for free to get every episode, here's how: If you're on an iPhone or iPad, open the app called Podcasts, or just tap this link . And we're going to talk about whether this move actually signals a meaningful shift in future regulation of this technology.
Glowing algae could power the lamps of the future
The bioluminescent plants are a potential alternative to electrical light and batteries. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Acidic (top) and basic (bottom) environments trigger different bioluminescent behaviors in algae. Breakthroughs, discoveries, and DIY tips sent six days a week. Bioluminescence is everywhere in nature, but it puts on its biggest light shows underwater .
Matthew Tkachuk continues to chase Team USA Hockey dominance as 2026 IIHF World Championship begins
President Trump on $1,000 World Cup ticket prices: 'I wouldn't pay it either, to be honest' Pirates vs. Diamondbacks betting preview targets the under as both offenses go cold in series Former LSU coach Brian Kelly uses AI to prepare for job interviews, proving he's just like the rest of us Newsom office source responds to planned protest against trans athlete at state playoff girls' track meet Framber Valdez gets what he deserves for punk move, suspended six games after drilling Boston's Trevor Story MLB's new automated strike zone has a hidden feature helping umpires become more accurate than ever'This can touch anyone': Gorman family speaks following loss of Sheridan'Project Freedom' could soon resume: Report Iranian people are not citizens, but'subjects' of the regime: Middle East expert Vice Admiral Robert Harward weighs in on restarting'Project Freedom' in Strait of Hormuz Largest teachers' union accused of antisemitism in federal civil rights complaint McEnany's URGENT plea: 'Be Spencer Pratt!' WHO doesn't expect large Hantavirus outbreak US blockade keeps stranglehold on Iran's economy The Panthers star told Pat McAfee the U.S. is heading to Switzerland to win, not for a vacation If anyone thought Team USA was satisfied with Olympic gold and ready to coast through the rest of the international hockey calendar, Matthew Tkachuk has a message. The Florida Panthers star joined The Pat McAfee Show on Thursday and discussed his plan to play for Team USA at the 2026 IIHF World Championship in Switzerland. USA Hockey's preliminary roster, announced May 7, includes Tkachuk for the first time, since the Panthers failed to reach the NHL playoffs this season. The tournament begins May 15 in Zurich and Fribourg, and the Americans are trying to win back-to-back gold medals at the event for the first time ever. Tkachuk made his mindset pretty clear.