Efficient Diffusion Policies for Offline Reinforcement Learning
–Neural Information Processing Systems
Offline reinforcement learning (RL) aims to learn optimal policies from offline datasets, where the parameterization of policies is crucial but often overlooked.
Neural Information Processing Systems
Feb-17-2026, 07:38:46 GMT
- Technology: