Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
–Neural Information Processing Systems
Exploring unknown environments efficiently is a fundamental challenge in unsupervised goal-conditioned reinforcement learning. While selecting exploratory goals at the frontier of previously explored states is an effective strategy, the policy during training may still have limited capability of reaching rare goals on the frontier, resulting in reduced exploratory behavior. We propose "Cluster Edge Exploration" (CE
Neural Information Processing Systems
May-29-2025, 23:32:46 GMT