AITopics

We study online prediction under distribution shift, where inputs arrive chronologically and outcomes are revealed only after prediction. In this setting, predictors must remain stable in quiet regimes yet adapt when regimes shift, and the right adaptation memory is unknown in advance. We propose MELO (Memory-hedged Exponentially Weighted Least-Squares Online aggregation), a model-agnostic method that hedges across adaptation scales: it wraps any non-anticipating base-predictor pool with exponentially weighted least-squares (EWLS) adaptation experts at multiple forgetting factors, and aggregates raw and EWLS-adapted forecasts with MLpol which is a parameter-free online aggregation rule. Under boundedness conditions, we establish deterministic oracle inequalities showing that it competes with both the best raw predictor and the best bounded, time-varying affine combinations of the base predictions, up to a path-length-dependent tracking cost and a sublinear aggregation overhead. We evaluate MELO on French national electricity-load forecasting through the COVID-19 lockdown using no regime indicators, lockdown dates, or policy covariates. MELO reduces overall RMSE by 34.7%relative to base-only MLpol and achieves lower overall RMSE than a TabICL reference supplied with an external COVID policy-response covariate. MELO requires only lightweight per-step recursive updates without model retraining.

data mining, machine learning, prediction, (20 more...)

2605.06541

Country: Europe (0.28)

Genre: Research Report (1.00)

Industry: Energy > Power Industry (0.88)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(2 more...)

Nar, Bengusu, Li, Jiguang, Ročková, Veronika, Toulis, Panos

Dynamic Treatment on Networks

In networks, effective dynamic treatment allocation requires deciding both whom to treat and also when, so as to amplify policy impact through spillovers. An early intervention at a well-connected node can trigger cascades that change which nodes are worth targeting in the next period. Existing treatment strategies under network interference are largely static while dynamic treatment frameworks typically ignore network structure altogether. We integrate these perspectives and propose Q-Ising, a three-stage pipeline that (i) estimates network adoption dynamics via a Bayesian dynamic Ising model from a single observed panel, (ii) augments treatment adoption histories with continuous posterior latent states, and (iii) learns a dynamic policy via offline reinforcement learning. The Bayesian mechanism enables uncertainty quantification over dynamic decisions, yielding posterior ensemble policies with interpretable spillover estimates. We provide a finite-sample regret upper bound that decomposes into standard offline-RL uncertainty, network abstraction error, and first stage error in Ising state estimation. We apply our method to data from Indian village microfinance networks and synthetic stochastic block models under simulated heterogeneous susceptible-infected-susceptible (SIS) dynamics and demonstrate that adaptive targeting outperforms static centrality benchmarks.

machine learning, node, reinforcement learning, (18 more...)

2605.06564

Country: North America > United States (0.46)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.46)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

A Geometry-Aware Residual Correction of Hagan's SABR Implied Volatility Formula

Reghai, Adil, Tarsissi, Lama, Biau, Gérard, Lipton, Alex

This paper proposes a hybrid methodology to improve the approximation of SABR (Stochastic Alpha Beta Rho) implied volatility by combining analytical structure with machine learning. The approach augments the neural-network input representation with geometric features derived from the stochastic differential equations of the SABR model. Unlike approaches that fully replace analytical formulas with black-box models, the proposed framework preserves the analytical backbone of the model. The hybridization operates along two complementary dimensions. First, geometry-aware variables reflecting intrinsic properties of the SABR dynamics are used as structured inputs to the network. Second, the neural network is trained to learn the residual error relative to Hagan's closed-form approximation rather than implied volatility directly. The resulting model acts as a structured residual correction to the analytical formula, retaining interpretability while capturing higher-order effects that are not included in the asymptotic expansion. Numerical experiments conducted over realistic parameter domains, as well as stressed environments, show that the method improves accuracy and robustness compared with both analytical approximations and standard neural-network approaches. Because the correction remains lightweight and structurally consistent with the underlying model, the framework is well suited for real-time pricing and calibration in practical trading environments.

artificial intelligence, machine learning, volatility, (18 more...)

2605.06604

Country: Europe (0.46)

Genre: Research Report (0.82)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Husar, Kateryna, Volfovsky, Alexander

DARTS: Targeting Prognostic Covariates in Budget-Constrained Sequential Experiments

Randomized controlled trials typically assume that prognostic covariates are known and available at no cost. In practice, obtaining high-dimensional pretreatment data is costly, forcing a trade-off between covariate-adaptive precision and a measurement budget. We introduce Dynamic Adaptive Rerandomization via Thompson Sampling (DARTS), which treats covariate acquisition as a sequential optimization problem embedded within a design-based causal inference task. A budgeted combinatorial Thompson sampler learns which covariates are most prognostic across successive batches; selected covariates then drive rerandomization and regression adjustment to reduce batch-level average treatment effect variance. Our primary theoretical contribution is a decoupling result: adaptive covariate selection based on past batches preserves batch-level randomization validity, and the cumulative inverse-variance weighted estimator achieves at least nominal asymptotic coverage. We further derive a Bayes risk bound for the acquisition layer that matches the minimax lower bound up to logarithmic factors. Empirically, DARTS systematically concentrates the budget on informative features, significantly closing the efficiency gap to oracle designs while maintaining strict inferential validity.

artificial intelligence, covariate, machine learning, (18 more...)

2605.06608

Genre:

Research Report > Strength High (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Zhang, Chenyang, Cao, Yuan

Transformers Efficiently Perform In-Context Logistic Regression via Normalized Gradient Descent

One widely recognized interpretation for their empirical success is their ability to perform in-context learning (ICL): pretrained transformers are capable of performing previously unseen tasks based on demonstrations and examples in the prompt, without requiring any additional task-specific fine-tuning (Brown et al., 2020). A line of recent works interpret the in-context learning (ICL) capability of transformers from an algorithmic perspective, viewing transformers as models that can implicitly execute certain learning algorithms on the context examples. Specifically, Garg et al. (2022) proposes a theoretical framework for ICL in terms of learning a hypothesis class, and empirically shows that transformers can in-context learn the linear function class. Motivated by this empirical finding, several recent works attempt to theoretically study how transformers perform in-context learning on linear regression tasks. Aky urek et al. (2022); Von Oswald et al. (2023) construct multi-layer transformers with linear attention that can execute gradient descent on the an "in-context loss" defined on the context data, thereby enabling in-context learning of linear regression.

ak 1, artificial intelligence, machine learning, (18 more...)

2605.06609

Genre:

Research Report > New Finding (0.64)
Research Report > Experimental Study (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.70)

The Structural Origin of Attention Sink: Variance Discrepancy, Super Neurons, and Dimension Disparity

Li, Siquan, Jiang, Kaiqi, Sun, Jiacheng, Hu, Tianyang

Despite the prevalence of the attention sink phenomenon in Large Language Models (LLMs), where initial tokens disproportionately monopolize attention scores, its structural origins remain elusive. This work provides a \textit{mechanistic explanation} for this phenomenon. First, we trace its root to the value aggregation process inherent in self-attention, which induces a systematic variance discrepancy. We further demonstrate that this discrepancy is drastically amplified by the activation of super neurons within Feed-Forward Network (FFN) layers. Specifically, the channel-sparse down-projections trigger a dimension disparity of the first-token representation, necessitating the formation of attention sinks as a structural anchor. Then, we validate this causal chain through two controlled interventions: (i) isolating the aggregation effect via attention mask modifications and (ii) amplifying the variance of targeted token representations. Both interventions can replicate attention sinks at arbitrary positions. Our mechanistic understanding offers a foundation for the systematic control of sink formation. Finally, as a proof of concept, we propose \textit{head-wise RMSNorm}, an architectural modification that stabilizes value aggregation outputs during pre-training. Our experiments demonstrate that restoring statistical parity across positions significantly accelerates convergence.

large language model, machine learning, natural language, (16 more...)

2605.06611

Country: Asia (0.46)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Online Bayesian Calibration under Gradual and Abrupt System Changes

Xu, Yang, Park, Chiwoo

Bayesian model calibration is central to digital twins and computer experiments, as it aligns model outputs with field observations by estimating calibration parameters and correcting systematic model bias. Classical Bayesian calibration introduces latent parameters and a discrepancy function to model bias, but suffers from parameter--discrepancy confounding and is typically formulated as an offline procedure under a stationary data-generating assumption. These limitations are restrictive in modern digital twin applications, where systems evolve over time and may exhibit gradual drift and abrupt regime shifts. While data assimilation methods enable sequential updates, they generally do not explicitly model systematic bias and are less effective under abrupt changes. We propose Bayesian Recursive Projected Calibration (BRPC), an online Bayesian calibration framework for streaming data under simulator mismatch and nonstationarity. BRPC extends projected calibration to the online setting by separating a discrepancy-free particle update for calibration parameters from a conditional Gaussian process update for discrepancy, preserving identifiability while enabling bias-aware adaptation under gradual system evolution. To handle abrupt changes, BRPC is integrated with restart mechanisms that detect regime shifts and reset the calibration process. We establish theoretical guarantees for both components, including tracking performance under gradual evolution and false-alarm and detection behavior for restart mechanisms. Empirical studies on synthetic and plant-simulation benchmarks show that BRPC improves calibration accuracy under gradual changes, while restart-augmented BRPC further improves robustness and predictive performance under abrupt regime shifts compared to sliding-window Bayesian calibration and data assimilation baselines.

artificial intelligence, machine learning, restart, (19 more...)

2605.06612

Country: North America > United States (0.45)

Genre: Research Report (0.82)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Data Science (0.92)
(3 more...)

WIREDMay-7-2026, 21:37:20 GMT

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained

Today on, we're diving into recent reports that the Trump administration is considering an executive order that would establish some sort of federal oversight over new AI models. This week on, the team discusses the surprising reports of the Trump administration seemingly reversing its stance when it comes to AI safety and regulation. We also look into what exactly is going on with the Hantavirus outbreak, and whether you should be worried. Also, we get into the story of how a former federal employee who was ousted by Elon Musk's so-called Department of Government Efficiency is now running for office. Plus, a Spirit Airlines laid off employee shares with us how they experienced the company's shutdown news last weekend and what they'll miss most about the job. A Federal Worker Was Fired for Filming DOGE. Write to us at [email protected] . You can always listen to this week's podcast through the audio player on this page, but if you want to subscribe for free to get every episode, here's how: If you're on an iPhone or iPad, open the app called Podcasts, or just tap this link . And we're going to talk about whether this move actually signals a meaningful shift in future regulation of this technology.

artificial intelligence, brian barrett, leah feiger, (11 more...)

WIRED

Country: North America > United States > California (0.28)

Genre: Personal > Interview (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.64)

Popular ScienceMay-7-2026, 20:01:00 GMT

Glowing algae could power the lamps of the future

The bioluminescent plants are a potential alternative to electrical light and batteries. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Acidic (top) and basic (bottom) environments trigger different bioluminescent behaviors in algae. Breakthroughs, discoveries, and DIY tips sent six days a week. Bioluminescence is everywhere in nature, but it puts on its biggest light shows underwater .

algae, artificial intelligence, physics popular science video space, (10 more...)

Popular Science

Country: North America > United States > Colorado (0.15)

Genre: Research Report > New Finding (0.36)

Industry:

Energy (0.30)
Consumer Products & Services > Personal Products (0.30)

Technology: Information Technology > Artificial Intelligence (0.52)

FOX NewsMay-7-2026, 19:26:06 GMT

Matthew Tkachuk continues to chase Team USA Hockey dominance as 2026 IIHF World Championship begins

President Trump on $1,000 World Cup ticket prices: 'I wouldn't pay it either, to be honest' Pirates vs. Diamondbacks betting preview targets the under as both offenses go cold in series Former LSU coach Brian Kelly uses AI to prepare for job interviews, proving he's just like the rest of us Newsom office source responds to planned protest against trans athlete at state playoff girls' track meet Framber Valdez gets what he deserves for punk move, suspended six games after drilling Boston's Trevor Story MLB's new automated strike zone has a hidden feature helping umpires become more accurate than ever'This can touch anyone': Gorman family speaks following loss of Sheridan'Project Freedom' could soon resume: Report Iranian people are not citizens, but'subjects' of the regime: Middle East expert Vice Admiral Robert Harward weighs in on restarting'Project Freedom' in Strait of Hormuz Largest teachers' union accused of antisemitism in federal civil rights complaint McEnany's URGENT plea: 'Be Spencer Pratt!' WHO doesn't expect large Hantavirus outbreak US blockade keeps stranglehold on Iran's economy The Panthers star told Pat McAfee the U.S. is heading to Switzerland to win, not for a vacation If anyone thought Team USA was satisfied with Olympic gold and ready to coast through the rest of the international hockey calendar, Matthew Tkachuk has a message. The Florida Panthers star joined The Pat McAfee Show on Thursday and discussed his plan to play for Team USA at the 2026 IIHF World Championship in Switzerland. USA Hockey's preliminary roster, announced May 7, includes Tkachuk for the first time, since the Panthers failed to reach the NHL playoffs this season. The tournament begins May 15 in Zurich and Fribourg, and the Americans are trying to win back-to-back gold medals at the event for the first time ever. Tkachuk made his mindset pretty clear.

artificial intelligence, social media, united states, (13 more...)

FOX News

Country:

North America > United States (1.00)
Asia > Middle East > Iran (0.54)
Europe > Switzerland > Zürich > Zürich (0.24)
Europe > Switzerland > Fribourg > Fribourg (0.24)

Genre: Personal > Honors (0.37)

Industry:

Leisure & Entertainment > Sports > Olympic Games (1.00)
Leisure & Entertainment > Sports > Hockey (1.00)
Leisure & Entertainment > Sports > Basketball (1.00)
Government > Regional Government > North America Government > United States Government (0.49)

Technology:

Information Technology > Communications > Social Media (0.71)
Information Technology > Artificial Intelligence (0.69)