Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization

Neural Information Processing Systems 

With high-dimensional state spaces, visual reinforcement learning (RL) faces significant challenges in exploitation and exploration, resulting in low sample efficiency and training stability.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found