Goto

Collaborating Authors

 Reinforcement Learning






Learning to Influence Human Behavior with Offline Reinforcement Learning

Neural Information Processing Systems

When interacting with people, AI agents do not just influence the state of the world - they also influence the actions people take in response to the agent, and even their underlying intentions and strategies.




Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning

Neural Information Processing Systems

Offline reinforcement learning (RL) leverages pre-collected datasets to train optimal policies. Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive policy class, significantly boosts the performance of offline RL. However, its reliance on iterative denoising sampling to generate actions slows down both training and inference.