AITopics | Energy

Collaborating Authors

Energy

TabPFN-3: Technical Report

Grinsztajn, Léo, Flöge, Klemens, Key, Oscar, Birkel, Felix, Jund, Philipp, Roof, Brendan, Manium, Mihir, Bin, Shi, Hoo, null, Bühler, Magnus, Garg, Anurag, Safaric, Dominik, Robertson, Jake, Jäger, Benjamin, Alessi, Simone, Hayler, Adrian, Moroshan, Vladyslav, Purucker, Lennart, Singer, Philipp, Arazi, Alan, Siems, Julien, Metzen, Jan Hendrik, Grab, Georg, Erickson, Nick, Guo, Siyuan, Kalfon, Eliott, Bing, Simon, Salinas, David, Cornu, Clara, Wehrhahn, Lilly Charlotte, Kriuchkova, Diana, Kaya, Kursat, Sidhoum, Lydia, Salmon, Marie, Chen, Jerry, Hulsebos, Madelon, LeCun, Yann, Müller, Samuel, Schölkopf, Bernhard, Gambhir, Sauraj, Hollmann, Noah, Hutter, Frank

arXiv.org Machine LearningMay-15-2026

Tabular data underpins most high-value prediction problems in science and industry, and TabPFN has driven the foundation model revolution for this modality. Designed with feedback from our users, TabPFN-3 builds on this foundation to scale state-of-the-art performance to datasets with 1M training rows and substantially reduce training and inference time. Pretrained exclusively on synthetic data from our prior, TabPFN-3 dramatically pushes the frontier of tabular prediction and brings substantial gains on time series, relational, and tabular-text data. On the standard tabular benchmark TabArena, a forward pass of TabPFN-3 outperforms all other models, including tuned and ensembled baselines, by a significant margin, and pareto-dominates the speed/performance frontier. On more diverse datasets, TabPFN-3 ranks first on datasets with many classes, and beats 8-hour-tuned gradient-boosted-tree baselines on datasets up to 1M training rows and 200 features. TabPFN-3 introduces test-time compute scaling to tabular foundation models. Our API offering TabPFN-3-Plus (Thinking) exploits this to beat all non-TabPFN models by over 200 Elo on TabArena, rising to 420 Elo on the largest data subset, and outperforms AutoGluon 1.5 extreme while being 10x faster, without using LLMs, real data, internet search or any other model besides TabPFN. TabPFN-3 extends the capabilities of our models, enabling SOTA prediction on relational data (new SOTA foundation model on RelBenchV1) and tabular-text data (SOTA on TabSTAR via TabPFN-3-Plus); and improves existing integrations: a specialized checkpoint, TabPFN-TS-3, ranks 2nd on the time-series benchmark fev-bench, and SHAP-value computation is up to 120x faster. TabPFN-3 achieves this performance while being up to 20x faster than TabPFN-2.5. In addition, a reduced KV cache and row-chunking scale to 1M rows on one H100 with fast inference speed.

arXiv.org Machine Learning

2605.13986

Country:

Asia (1.00)
North America > United States (0.67)

Genre: Research Report > Experimental Study (1.00)

Industry:

Materials (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
(21 more...)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
(6 more...)

Add feedback

Policy Optimization in Hybrid Discrete-Continuous Action Spaces via Mixed Gradients

Alvo, Matias, Russo, Daniel, Kanoria, Yash

arXiv.org Machine LearningMay-15-2026

We study reinforcement learning in hybrid discrete-continuous action spaces, such as settings where the discrete component selects a regime (or index) and the continuous component optimizes within it -- a structure common in robotics, control, and operations problems. Standard model-free policy gradient methods rely on score-function (SF) estimators and suffer from severe credit-assignment issues in high-dimensional settings, leading to poor gradient quality. On the other hand, differentiable simulation largely sidesteps these issues by backpropagating through a simulator, but the presence of discrete actions or non-smooth dynamics yields biased or uninformative gradients. To address this, we propose Hybrid Policy Optimization (HPO), which backpropagates through the simulator wherever smoothness permits, using a mixed gradient estimator that combines pathwise and SF gradients while maintaining unbiasedness. We also show how problems with action discontinuities can be reformulated in hybrid form, further broadening its applicability. Empirically, HPO substantially outperforms PPO on inventory control and switched linear-quadratic regulator problems, with performance gaps increasing as the continuous action dimension grows. Finally, we characterize the structure of the mixed gradient, showing that its cross term -- which captures how continuous actions influence future discrete decisions -- becomes negligible near a discrete best response, thereby enabling approximate decentralized updates of the continuous and discrete components and reducing variance near optimality.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Machine Learning

2605.14297

Genre: Research Report (0.82)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)

Add feedback

From Data to Action: Accelerating Refinery Optimization with AI

Pfeifer, Dániel, Papp, Ábrahám, Bernáth, Tibor, Varga, Tamás Zoltán, Czifra, Márk, Szilágyi, Botond, Kovács, Edith Alice

arXiv.org Machine LearningMay-15-2026

Nowadays refinery optimization utilizes sheer amounts of data, which can be handled with modern Linear Programming (LP) software, but the interpreting and applying the results remains challenging. Large petrochemical companies use massive models, with hundreds of thousands of input matrix elements. The LP solution is mathematically correct, but simplifications are made in the model, and data supply errors may occur. Therefore, further insight is needed to trust the results. The LP solver does not have a memory, so additional understanding could be gained by analyzing historical data and comparing it to the current plan. As such, machine learning approaches were suggested to support decision making based on the LP solution. Among these, Anomaly Detection tools are proposed to be used in tandem with the LP output. A transformed version of the popular ECOD methodology is applied. New methods are proposed to handle high-dimensional data: choosing the most informative pairs. Then, this is used alongside two 2D Anomaly Detection algorithms, revealing several business opportunities and data supply errors in the MOL refinery scheduling and planning architecture.

arXiv.org Machine Learning

2605.15085

Country: Europe > Hungary (0.14)

Genre: Research Report (1.00)

Industry:

Materials > Chemicals > Commodity Chemicals > Petrochemicals (1.00)
Energy > Oil & Gas > Upstream (1.00)
Energy > Oil & Gas > Downstream (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Cuba plunged deeper into fuel crisis amid 22-hour blackouts

Al JazeeraMay-14-2026, 18:13:48 GMT

Cuba faced worsening nationwide disruption after officials say the country had completely run out of diesel and fuel oil, triggering severe blackouts and a partial collapse of the national power grid. Residents in Havana endured 22-hour electricity cuts amid a US blockade on the island.

artificial intelligence, live navigation menu news show, news section africa asia us, (6 more...)

Al Jazeera

Country:

North America > Cuba > La Habana Province > Havana (0.27)
Asia > Middle East > Israel (0.24)

Industry: Energy (0.59)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.55)

Add feedback

Japan runs out of robot wolves in fight against bears

The handmade, $4,000 Monster Wolf features solar panels, sensors, and intimidating audio. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. The robot scarecrows are used to ward off bears in rural areas. Breakthroughs, discoveries, and DIY tips sent six days a week. Japan's bear problem continues, and the country is running out of the robot wolves that help keep them at bay.

artificial intelligence, physics popular science video space, weirdest thing, (9 more...)

Popular Science

Country:

Asia > Japan (0.75)
North America > United States (0.49)

Genre: Research Report (0.36)

Industry:

Energy (0.51)
Health & Medicine > Therapeutic Area (0.34)

Technology: Information Technology > Artificial Intelligence > Robots (0.96)

Add feedback

'The world is sounding an alarm': Why big tech is the new colonist

Al JazeeraMay-14-2026, 15:08:38 GMT

Why is Israel still in southern Lebanon? A war to shape Lebanon's future'The world is sounding an alarm': Why big tech is the new colonist When investigations by Al Jazeera and other media outlets in 2024 revealed that Israeli-linked artificial intelligence (AI) systems such as Lavender and Gospel had helped generate thousands of military targets in Gaza, critics warned that warfare was entering a new era - one driven not only by soldiers and bombs, but by algorithms, data, and surveillance technology. Then, in September 2024, thousands of pagers and walkie-talkies used by members of Hezbollah exploded in coordinated attacks in Lebanon, widely attributed to Israeli intelligence operations that had turned ordinary communication devices into weapons. And, last year, reporting by Al Jazeera also raised concerns about the use of cloud and data infrastructure linked to major US technology companies in Israeli surveillance operations involving Palestinians. For a growing number of scholars, economists and political thinkers, such developments reflect more than just the changing nature of conflict.

artificial intelligence, live navigation menu news show, news section africa asia us, (7 more...)

Al Jazeera

Country:

Asia > Middle East > Republic of Türkiye (0.71)
Asia > Middle East > Lebanon (0.68)
Asia > Middle East > Palestine > Gaza Strip > Gaza Governorate > Gaza (0.25)

Industry:

Information Technology (0.92)
Energy (0.73)
Government > Military (0.49)
(2 more...)

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

Plan Before You Trade: Inference-Time Optimization for RL Trading Agents

Go, Eun, Deb, Rohan, Banerjee, Arindam

arXiv.org Machine LearningMay-14-2026

Reinforcement learning agents for portfolio management are typically trained and deployed as static policies, with no mechanism for using price forecasts at inference time. We propose $\text{FPILOT}$ (**Fin**ancial **P**lugin **I**nference-time **L**earning for **O**ptimal **T**rading), a plugin inference-time optimization framework inspired by Model Predictive Control (MPC). Our key structural insight is that future prices mostly do not depend on one agent's portfolio allocation, so a suitable predictive model can produce a multi-step price trajectory without iterative action-conditioned rollouts as in typical reinforcement learning. At each decision step, we use the forecaster's predicted price trajectory to construct an allocation-based imagined return objective, and optimize the policy at inference-time before executing one step of the trade. Our framework is compatible with any pre-trained agent and adapts the policy to the forecaster's predictions without any retraining. Evaluated across five policy learning algorithms on the TradeMaster DJ30 benchmark, $\text{FPILOT}$ produces consistent improvements in total return and return-based risk-adjusted metrics (Sharpe, Sortino, Calmar), with stochastic policies benefiting more than deterministic ones. Further, using synthetic forecasts at calibrated quality levels, we show that gains consistently improve with forecaster quality, suggesting that our performance will improve based on advances in financial forecasting.

arXiv.org Machine Learning

2605.12653

Country: North America > United States > New York (0.28)

Genre: Research Report (0.64)

Industry:

Banking & Finance > Trading (1.00)
Energy > Oil & Gas (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback

ISOMORPH: A Supply Chain Digital Twin for Simulation, Dataset Generation, and Forecasting Benchmarks

Zhang, Zhizhen, Gu, Hyemin, Zhang, Benjamin J., Elenius, Daniel, Tyrrell, Michael, Bourdais, Theo J., Owhadi, Houman, Katsoulakis, Markos A., Sahai, Tuhin

arXiv.org Machine LearningMay-14-2026

Open time-series forecasting (TSF) benchmarks cover retail, energy, weather, and traffic, but supply-chain logistics remains underserved. We introduce ISOMORPH, the first public digital twin of a multi-echelon logistics network with fully interpretable, user-configurable parameters and modular topology, demand process, and control rules. The simulator advances a directed routing graph in discrete time: demand arrives at the destination, is served from stock or recorded as backlog, and triggers replenishment through the network. The state vector tracks per-node on-hand inventory with outstanding orders, in-transit shipments, and a smoothed demand estimate, so the dynamics close as a Markov chain on a tractable state space whose transition kernel acts linearly on the empirical distribution of the state. The released data reproduces the bullwhip effect at empirically consistent magnitudes, and three conservation laws encoded in the Markov chain serve as verification tools when users extend the simulator. We release datasets at two catalogue scales ($C=50$ and $C=200$) with six scenario sweeps producing 30 additional rollouts and 20 Latin-hypercube perturbations, exhibiting dynamics absent from fixed TSF benchmarks: variance amplification, cascading bottlenecks, regime shifts, and cross-channel coupling through shared macro shocks. Zero-shot evaluation of four foundation models (Chronos, Moirai, TimesFM, Lag-Llama) shows MASE values exceeding public GIFT-Eval references at low-to-moderate horizons, supporting incorporation into existing benchmarks. The same pairing produces forecast confidence bands via Latin-hypercube perturbation of demand-side knobs, forward UQ from parameter uncertainty unavailable on standard TSF datasets, demonstrating that foundation models can serve as fast surrogates for the digital twin's forward UQ. Code (MIT): https://github.com/tuhinsahai/ISOMORPH.

artificial intelligence, machine learning, simulator, (18 more...)

arXiv.org Machine Learning

2605.12768

Country: North America > United States (1.00)

Genre: Research Report (0.40)

Industry:

Banking & Finance > Trading (0.34)
Energy > Power Industry (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.54)

Add feedback

On Hallucinations in Inverse Problems: Fundamental Limits and Provable Assessment Methods

Iagaru, David, Gottschling, Nina M., Hansen, Anders C., Garnier, Josselin

arXiv.org Machine LearningMay-14-2026

While deep learning has revolutionised inverse problems, its safe deployment is hindered by three primary reliability concerns: hallucinations, instabilities, and performance volatility [48]. Hallucinations manifest as high-fidelity features that are factually false; instabilities reflect heightened sensitivity to measurement noise; and performance volatility refers to significant fluctuations in reconstruction quality across the data, yielding high-fidelity results for some samples while failing on seemingly similar images. In many applications, the risk of generating realistic but unfaithful content can impede the safe deployment of AI methods for inverse problems. The choice of "hallucinate" as the Cambridge Dictionary's word of the year in 2023 illustrates this open problem [53]. The problem of AI hallucinations persists, as the Financial Times [44] highlighted that, "AI hallucinations haunt users more than job losses." A first step toward training AI methods that do not suffer from hallucinations is the assessment and identification of hallucinated outputs. Consider the inverse problem of recovering xfrom noisy measurements y " Fpx,eq, x PM1 ĂX, e PEĂY, (1.1)

artificial intelligence, machine learning, reconstruction, (18 more...)

arXiv.org Machine Learning

2605.13146

Country:

North America > United States (1.00)
Europe (0.67)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Energy (1.00)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

What is Learnable in Valiant's Theory of the Learnable?

Hanneke, Steve, Mehrotra, Anay, Velegkas, Grigoris, Zampetakis, Manolis

arXiv.org Machine LearningMay-14-2026

Valiant's 1984 paper is widely credited with introducing the PAC learning model, but it, in fact, introduced a different model: unlike PAC learning, the learner receives only positives, may issue membership queries, and must output a hypothesis with no false positives. Prior work characterized variants, including the case without queries. We revisit Valiant's original model and ask: *Which classes are learnable in it?* For every finite domain, including Valiant's Boolean-hypercube setting, we show that a class is learnable if and only if every realizable positive sample can be certified by a poly-size adaptive query-compression scheme. This is a new variant of sample compression where the learner certifies samples via a short interaction with the membership oracle. Our characterization shows that learnability in Valiant's model is strictly sandwiched between learnability in the PAC model and the variant of Valiant's model without membership queries. This is one of the rare cases where introducing membership queries changes the set of learnable classes, and not just the sample or computational complexity. Next, we study the natural extension of the model to arbitrary domains. While we do not obtain an exact characterization, our techniques readily generalize and show that the same strict sandwiching persists. Finally, we show that $d$-dimensional halfspaces, which are not learnable without queries, are learnable with queries: we give a $\mathrm{poly}(d) \tilde{O}(1/ε)$ sample and $\mathrm{poly}(d) \mathrm{polylog}(1/ε)$ query algorithm, and prove that at least $Ω(d)$ samples or queries are necessary. To our knowledge, this is the first algorithm for halfspaces in Valiant's model. Together, these results uncover a surprisingly rich theory behind Valiant's original notion of learnability and introduce ideas that may be of independent interest in learning theory.

artificial intelligence, machine learning, valiant, (17 more...)

arXiv.org Machine Learning

2605.1384

Country: North America > United States (0.27)

Genre: Research Report (1.00)

Industry:

Education (0.45)
Energy (0.45)
Law (0.33)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)

Add feedback