AITopics | dime

4b121e627d3c5683f312ad168988f3f0-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 02:43:33 GMT

A.2 MainProofsketch In this section we will give a theoretical guarantee for the performance of our algorithm. Essentially, it measures the largest total difference of value estimation among all the functions in f Ft for the fixed inputsxt,i wherei [M]. Lemma 2. If (βt 0 | t N) is a nondecreasing sequence and Ft:=n Themainstructure ofthisproof issimilar toproposition 3,section CinEluder dimension's paper, and we will only point out the subtle details that makes the difference. Apart from the notations section 3, we add more symbols for the regret analysis. Next, we will show thatf h is a feasible solution for the optimization ofFt.

artificial intelligence, def, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

AgnosticQ-learningwithFunctionApproximationin DeterministicSystems: Near-OptimalBoundson ApproximationErrorandSampleComplexity

Neural Information Processing SystemsFeb-11-2026, 06:16:25 GMT

Therefore, we help address the open problem on agnosticQ-learning proposed in [Wen and Van Roy,2013].

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

Add feedback

a3a7387e49f4de290c23beea2dfcdc75-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 03:11:21 GMT

descent step, dime, exp, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.72)

Add feedback

a3a7387e49f4de290c23beea2dfcdc75-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 03:11:17 GMT

arxiv preprint arxiv, graph, optimization, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Approximation

Neural Information Processing SystemsFeb-8-2026, 06:04:50 GMT

Moreover,our algorithm is model-free and provides a framework to justify the effectiveness of algorithms usedinpractice.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

Neural Information Processing SystemsDec-24-2025, 21:54:51 GMT

Recently, deep reinforcement learning (DRL) models have shown promising results in solving NP-hard Combinatorial Optimization (CO) problems. However, most DRL solvers can only scale to a few hundreds of nodes for combinatorial optimization problems on graphs, such as the Traveling Salesman Problem (TSP). This paper addresses the scalability challenge in large-scale combinatorial optimization by proposing a novel approach, namely, DIMES. Unlike previous DRL methods which suffer from costly autoregressive decoding or iterative refinements of discrete solutions, DIMES introduces a compact continuous space for parameterizing the underlying distribution of candidate solutions. Such a continuous space allows stable REINFORCE-based training and fine-tuning via massively parallel sampling. We further propose a meta-learning framework to enable the effective initialization of model parameters in the fine-tuning stage. Extensive experiments show that DIMES outperforms recent DRL-based methods on large benchmark datasets for Traveling Salesman Problems and Maximal Independent Set problems.

combinatorial optimization problem, differentiable meta solver, name change, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

a3a7387e49f4de290c23beea2dfcdc75-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 08:53:16 GMT

artificial intelligence, dime, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.72)

Add feedback

a3a7387e49f4de290c23beea2dfcdc75-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 08:53:13 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

DIME:Diffusion-Based Maximum Entropy Reinforcement Learning

Celik, Onur, Li, Zechu, Blessing, Denis, Li, Ge, Palanicek, Daniel, Peters, Jan, Chalvatzaki, Georgia, Neumann, Gerhard

arXiv.org Artificial IntelligenceFeb-4-2025

Maximum entropy reinforcement learning (MaxEnt-RL) has become the standard approach to RL due to its beneficial exploration properties. Traditionally, policies are parameterized using Gaussian distributions, which significantly limits their representational capacity. Diffusion-based policies offer a more expressive alternative, yet integrating them into MaxEnt-RL poses challenges--primarily due to the intractability of computing their marginal entropy. To overcome this, we propose Diffusion-Based Maximum Entropy RL (DIME). DIME leverages recent advances in approximate inference with diffusion models to derive a lower bound on the maximum entropy objective. Additionally, we propose a policy iteration scheme that provably converges to the optimal diffusion policy. Our method enables the use of expressive diffusion-based policies while retaining the principled exploration benefits of MaxEnt-RL, significantly outperforming other diffusion-based methods on challenging high-dimensional control benchmarks. It is also competitive with state-of-the-art non-diffusion based RL methods while requiring fewer algorithmic design choices and smaller update-to-data ratios, reducing computational complexity.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2502.02316

Country:

North America > United States (0.29)
Europe > Germany (0.28)

Genre: Research Report (0.40)

Industry:

Energy > Oil & Gas > Upstream (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Maximum Entropy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

Neural Information Processing SystemsJan-18-2025, 08:27:58 GMT

Recently, deep reinforcement learning (DRL) models have shown promising results in solving NP-hard Combinatorial Optimization (CO) problems. However, most DRL solvers can only scale to a few hundreds of nodes for combinatorial optimization problems on graphs, such as the Traveling Salesman Problem (TSP). This paper addresses the scalability challenge in large-scale combinatorial optimization by proposing a novel approach, namely, DIMES. Unlike previous DRL methods which suffer from costly autoregressive decoding or iterative refinements of discrete solutions, DIMES introduces a compact continuous space for parameterizing the underlying distribution of candidate solutions. Such a continuous space allows stable REINFORCE-based training and fine-tuning via massively parallel sampling.

combinatorial optimization problem, differentiable meta solver, salesman problem, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

dime

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

4b121e627d3c5683f312ad168988f3f0-Supplemental-Conference.pdf

AgnosticQ-learningwithFunctionApproximationin DeterministicSystems: Near-OptimalBoundson ApproximationErrorandSampleComplexity

a3a7387e49f4de290c23beea2dfcdc75-Supplemental-Conference.pdf

a3a7387e49f4de290c23beea2dfcdc75-Paper-Conference.pdf

Approximation

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

a3a7387e49f4de290c23beea2dfcdc75-Supplemental-Conference.pdf

a3a7387e49f4de290c23beea2dfcdc75-Paper-Conference.pdf

DIME:Diffusion-Based Maximum Entropy Reinforcement Learning

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems