Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization

Neural Information Processing Systems 

With high-dimensional state spaces, visual reinforcement learning (RL) faces significant challenges in exploitation and exploration, resulting in low sample efficiency and training stability.