Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning

Feb-14-2026, 03:44:44 GMT–Neural Information Processing Systems

Offline reinforcement learning (RL) leverages pre-collected datasets to train optimal policies. Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive policy class, significantly boosts the performance of offline RL. However, its reliance on iterative denoising sampling to generate actions slows down both training and inference.

diffusion model, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Feb-14-2026, 03:44:44 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Texas > Travis County > Austin (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
59a48c111f97f2174709ea9ed8e920d1-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found