AITopics | Energy

Collaborating Authors

Energy

Dr Jekyll and Mr Hyde: The Strange Case of Off-Policy Policy Updates Romain Laroche Microsoft Research Montréal, Canada Rémi T achet des Combes Microsoft Research Montréal, Canada

Neural Information Processing SystemsAug-17-2025, 10:19:26 GMT

The policy gradient theorem states that the policy should only be updated in states that are visited by the current policy, which leads to insufficient planning in the off-policy states, and thus to convergence to suboptimal policies. We tackle this planning issue by extending the policy gradient theory to policy updates with respect to any state density. Under these generalized policy updates, we show convergence to optimality under a necessary and sufficient condition on the updates' state densities, and thereby solve the aforementioned planning issue. We also prove asymptotic convergence rates that significantly improve those in the policy gradient literature. To implement the principles prescribed by our theory, we propose an agent, Dr Jekyll & Mr Hyde (J&H), with a double personality: Dr Jekyll purely exploits while Mr Hyde purely explores. J&H's independent policies allow to record two separate replay buffers: one on-policy (Dr Jekyll's) and one off-policy (Mr Hyde's), and therefore to update J&H's models with a mixture of on-policy and off-policy updates. More than an algorithm, J&H defines principles for actor-critic algorithms to satisfy the requirements we identify in our analysis. We extensively test on finite MDPs where J&H demonstrates a superior ability to recover from converging to a suboptimal policy without impairing its speed of convergence. We also implement a deep version of the algorithm and test it on a simple problem where it shows promising results.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.76)

Genre: Research Report > New Finding (0.46)

Industry: Energy > Oil & Gas > Upstream (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
(2 more...)

Add feedback

34b3a40ec9752c1ae48fe85fef8fe8dc-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 09:24:28 GMT

artificial intelligence, large language model, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon (0.14)
North America > Canada > Alberta (0.14)
Europe > Russia (0.14)
(2 more...)

Genre:

Research Report > Experimental Study (1.00)
Workflow (0.66)

Industry:

Information Technology (0.93)
Media (0.68)
Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (1.00)

Add feedback

NSNet: A General Neural Probabilistic Framework for Satisfiability Problems Zhaoyu Li

Neural Information Processing SystemsAug-17-2025, 09:02:18 GMT

NSNet can be flexibly configured to solve both SA T and #SA T problems by applying different learning objectives.

artificial intelligence, assignment, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Industry: Energy > Oil & Gas (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

Multiwavelet-based Operator Learning for Differential Equations

Neural Information Processing SystemsAug-17-2025, 09:02:10 GMT

The projected kernel is trained at multiple scales derived from using repeated computation of multiwavelet transform. This allows learning the complex dependencies at various scales and results in a resolution-independent scheme.

artificial intelligence, machine learning, operator, (20 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Industry:

Government > Regional Government > North America Government > United States Government (0.67)
Energy > Oil & Gas > Upstream (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Mathematics of Computing (0.92)

Add feedback

a2affd71d15e8fedffe18d0219f4837a-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 08:25:32 GMT

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Hong Kong (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Energy > Renewable (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Constrained Sampling with Primal-Dual Langevin Monte Carlo

Neural Information Processing SystemsAug-17-2025, 08:08:51 GMT

To do so, we bring classical optimization arguments for saddle-point algorithms to the geometry of Wasserstein space.

artificial intelligence, bayesian inference, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.27)
Europe > Germany (0.14)
Europe > Switzerland (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Banking & Finance (0.67)
Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.67)
Information Technology > Data Science (0.67)
(2 more...)

Add feedback

CNN Training data Test image Denoised image! CNN Test image Denoised image! CNN Training data Test image Denoised image!

Neural Information Processing SystemsAug-17-2025, 07:58:23 GMT

Here we propose "GainTuning", a

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Arizona (0.04)
North America > Puerto Rico > San Juan > San Juan (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
Europe > France (0.04)

Genre: Research Report (1.00)

Industry:

Energy (0.68)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

LAVIB: A Large-scale Video Interpolation Benchmark

Neural Information Processing SystemsAug-17-2025, 07:35:24 GMT

LA VIB comprises a large collection of high-resolution videos sourced from the web through an automated pipeline with minimal requirements for human verification. Metrics are computed for each video's motion magnitudes, luminance conditions, frame

artificial intelligence, machine learning, video, (19 more...)

Neural Information Processing Systems

Industry: Energy > Oil & Gas (0.46)

Technology: