Goto

Collaborating Authors

 Reinforcement Learning




Unsupervised Behavior Extraction via Random Intent Priors

Neural Information Processing Systems

Reward-free data is abundant and contains rich prior knowledge of human behaviors, but it is not well exploited by offline reinforcement learning (RL) algorithms. In this paper, we propose UBER, an unsupervised approach to extract useful behaviors from offline reward-free datasets via diversified rewards.




Generalized Weighted Path Consistency for Mastering Atari Games

Neural Information Processing Systems

Reinforcement learning with the help of neural-guided search consumes huge computational resources to achieve remarkable performance.