Efficient Diffusion Policies For Offline Reinforcement Learning

Dec-26-2025, 21:05:25 GMT–Neural Information Processing Systems

Offline reinforcement learning (RL) aims to learn optimal policies from offline datasets, where the parameterization of policies is crucial but often overlooked. Recently, Diffsuion-QL significantly boosts the performance of offline RL by representing a policy with a diffusion model, whose success relies on a parametrized Markov Chain with hundreds of steps for sampling. However, Diffusion-QL suffers from two critical limitations.

efficient diffusion policy, name change, offline reinforcement learning, (4 more...)

Neural Information Processing Systems

Dec-26-2025, 21:05:25 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.46)
  - Learning Graphical Models (0.43)