Goto

Collaborating Authors

 Energy


Leave a Window Out: Modifying the Jackknife for Predictive Inference in Time Series

arXiv.org Machine Learning

Conformal prediction methods enjoy strong theoretical and empirical predictive inference performance, provided the data is exchangeable, and predictors are trained in a memoryless fashion. However, these assumptions and constraints are impractical in many real-data settings, such as time series (where temporal dependence violates exchangeability, and where memoryless predictors will inevitably have poor predictive accuracy). Recent work shows that the split conformal prediction method is robust to these issues of memory-based predictors and deviations from exchangeability that are common features of time-series data. However, since using sample splitting can lead to lower accuracy, this motivates asking whether other predictive inference methods (that do not rely on data splitting) could also be reliably used in the time series setting. In this work, we show that the vanilla leave-one-out jackknife can suffer an arbitrary loss of coverage even in canonical time series models with mild temporal dependence. As a remedy, we propose a careful modification tailored to such settings, which we term the \emph{leave-a-window-out} (LWO) method, and show that it can achieve valid coverage provided that the model-fitting procedure satisfies mild stability properties. Our proofs are based on quantifying the degree to which the data departs from \emph{cyclic exchangeability}, and we introduce new coefficients to measure the extent of this departure. Experiments on time series data demonstrate that our LWO method often enjoys valid coverage when the vanilla jackknife fails to cover, while producing much narrower intervals than split conformal prediction.


Kelsey Pfendler is trying to become the youngest woman to row solo from California to Hawaii

Popular Science

Pfendler has already faced blisters, brutal winds, and lost freshwater in the first week of her over 2,400 mile journey. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Kelsey Pfendler will share updates along the way via social media. Breakthroughs, discoveries, and DIY tips sent six days a week. By signing up, you confirm you are 16+, will receive newsletters and promotional content and agree to our Terms of Use and acknowledge the data practices in our Privacy Policy .


Oil prices fall after report of breakthrough in US-Iran talks

BBC News

Oil prices have dropped following a report the US and Iran have reached a deal, subject to President Donald Trump's approval. Axios reported officials had made an agreement over an extended ceasefire on Thursday. It drove the price of a barrel of Brent crude down to a low of $93.36 from a earlier high of $98, before rebounding to about $94. Prices had jumped earlier after the US carried out new attacks on Iran, targeting a military site in Bandar Abbas, a strategic port city. The strikes occurred despite an ongoing ceasefire between Tehran and Washington to allow for talks to end the three-month-long war that has effectively closed the Strait of Hormuz waterway, pushing up global energy costs.


Irish datacentres have increased household bills by hundreds of euros, report finds

The Guardian

Datacentre industry representatives disputed the findings and said the sector boosted the economy. Datacentre industry representatives disputed the findings and said the sector boosted the economy. 'Hidden datacentre tax' costing Irish households millions, report says Datacentres used 22% of country's electricity last year, pushing up household bills, study suggests Thu 28 May 2026 09.01 EDTLast modified on Thu 28 May 2026 09.32 EDT Energy demand by datacentres in Ireland has added hundreds of euros to household electricity bills in a pattern that could be replicated across Europe, according to a report. Ireland's growing number of datacentres last year used 22% of the country's electricity, more than all urban homes combined, according to the Central Statistics Office. The equivalent figure in the US and UK is 6%.


The Download: climate tech goes public and the AI Hype Index returns

MIT Technology Review

Plus: Illinois just passed what could become America's strongest AI safety law. Climate tech companies are going public. Solar and battery company Solv Energy went public in February, hitting a $6 billion valuation. X-energy, which builds small modular nuclear reactors, followed at $11.5 billion. Then came geothermal company Fervo Energy, reaching a market cap of about $12.4 billion. All three have been IPO success stories.


China's secret weapon in AI race with US? Lots of cheap energy

Al Jazeera

In the race against China for AI supremacy, the United States dominates when it comes to access to the most cutting-edge semiconductors. But when it comes to powering the huge data centres that run on AI chips, China holds the clear advantage. A typical data centre can consume as much electricity as 100,000 households, while next-generation "hyperscale" facilities can gobble up as much power as two million homes, according to the International Energy Agency (IEA). China's access to an abundant supply of cheap electricity places it in the ideal position to meet such colossal energy demands. China already generates more than twice as much electricity as the US, a lead that is expected to widen amid an aggressive state-led investment in the country's energy grid.


Stop Suppressing the Tail: Causal Inference for Extreme Events

arXiv.org Machine Learning

Estimating how an outcome responds to a continuous treatment (the Average Dose-Response Function, or ADRF) is a core causal-inference primitive. However, when outcomes possess heavy tails, standard robust double machine learning (DML) deliberately suppresses these extremes to stabilize the bulk average. In high-stakes settings, such as financial returns or climate losses, this omitted 1-in-1000 extreme event is the actual target quantity. Furthermore, current methods that read the tail from a model's residuals suffer from circular dependence, causing tail shape inferences to shift drastically based solely on whether the core estimator is switched between Huber and Welsch.The research proposes an ADRF estimator that emits a structured tail-shape output alongside the standard point estimate. Its tail diagnostic (PDHTE+JK) evaluates the per-treatment tail shape from the outcome centered by a pilot median, successfully breaking the circular dependence and rendering the diagnostic invariant to the choice of core method. The output encompasses four treatment-conditional quantities: tail shape $\hatฮพ(t)$, deep-tail return levels $\hat{Q}_ฮฑ(t)$, conditional shortfalls $\hat{S}_ฮฑ(t)$, the recovered mean ADRF, and an explicit refusal mechanism that declines extrapolation when extreme-value modeling is unsupported by the data. Compared to kernel-weighted quantile regression (QR), the proposed estimator reduces deep-tail ($ฮฑ=0.001$) return-level MAE by 11% and conditional-shortfall MAE by 25.5% across a heavy-tailed panel. It also achieves a 20-29% MAE reduction in sample-scarce regimes ($n\le2000$). On freMTPL2 motor-insurance claims, it successfully triggered an explicit extrapolation refusal on the log-claim scale, which neither QR nor loss-only DML can produce.


GenSBI: Generative Methods for Simulation-Based Inference in JAX

arXiv.org Machine Learning

Flow and diffusion generative models have established themselves as widely adopted density estimators for simulation-based inference (SBI), extending naturally from neural posterior estimation to likelihood and joint density estimation. Their principled optimization objectives and freedom from architectural constraints have driven rapid adoption across the natural sciences. Yet the most widely used SBI libraries remain PyTorch-based, leaving researchers who develop their forward models and analysis pipelines in JAX without a native option. We present GenSBI, an open-source library that implements flow matching, score matching, and denoising diffusion entirely in JAX. The library offers three transformer-based architectures -- SimFormer, Flux1, and a novel Flux1Joint that extends gate-modulated transformer blocks to joint density estimation -- all interchangeable through a unified interface that decouples generative method, neural backbone, and inference mode. GenSBI provides an end-to-end workflow from training through posterior calibration (SBC, TARP, LC2ST) and supports custom architectures with domain-specific embedding networks.


Deep Neural Network Training as Random Effects: An Optimization-Inference Duality

arXiv.org Machine Learning

Deep neural networks (DNNs) have achieved remarkable empirical success, yet their training dynamics remain understood mainly from optimization rather than statistical principles. Here we develop a statistical framework for DNN training in the over-parameterized regime by showing that the prediction induced by continuous-time neural tangent kernel (NTK) gradient flow is exactly equivalent to that from a classical random-effects model. In this framework, training time acts as a variance component, or equivalently an empirical Bayes covariance hyperparameter, governing the allocation of variation from noise to structured signal. This equivalence reveals an optimization-inference duality: the gradient-flow path is both an optimization trajectory and an empirical Bayes random-effects inference path. Conditional on training time, the network output is the posterior mean of the latent signal, and estimating training time by restricted maximum likelihood (REML) turns early stopping into likelihood-based empirical Bayes inference rather than external tuning. This perspective yields a two-stage inferential procedure. First, a variance-component test determines whether DNN training captures statistically significant structure beyond initialization. Second, conditional on training being warranted, REML provides a likelihood-based early stopping rule. The resulting stopping time admits a spectral interpretation in the NTK eigenbasis, where training proceeds until spectral loss decorrelation is achieved. We further establish that REML-guided early stopping achieves asymptotically optimal prediction error for fixed-design in-sample prediction and, under additional random-design regularity conditions, for out-of-sample prediction. This work reframes DNN training as statistical inference and provides a principled foundation for deciding whether and how long to train deep neural networks.


Adaptive Bandit Algorithms for Contextual Matching Markets

arXiv.org Machine Learning

We study bandit learning in matching markets, where players and arms constitute the two market sides, and the players' utilities are linear in the arm contexts. In each round, new arms arrive with observable contexts. Then, the algorithm matches them to players, aiming to minimize each player's regret against a stable matching benchmark. This contextual structure creates significant complexity: subtle context shifts can slightly alter one player's utility while completely reconfiguring the underlying benchmark, causing large regret spikes for others. We address this in two settings: stochastic contexts, drawn from a latent distribution, and adversarial contexts, which may be arbitrary. For the stochastic case, we introduce a novel minimum preference gap to capture learning difficulty and provide a fully adaptive algorithm with an instance-dependent poly-logarithmic regret upper bound. We also establish matching instance-independent regret upper and lower bounds under a mild distributional assumption. For the adversarial setting, we propose a tractable regret notion that remains valid under arbitrary contexts and achieves an instance-independent sublinear regret bound via an adaptive algorithm.