HIQL: Offline Goal-Conditioned RL with Latent States as Actions

Dec-25-2025, 22:57:10 GMT–Neural Information Processing Systems

Unsupervised pre-training has recently become the bedrock for computer vision and natural language processing. In reinforcement learning (RL), goal-conditioned RL can potentially provide an analogous self-supervised approach for making use of large quantities of unlabeled (reward-free) data. However, building effective algorithms for goal-conditioned RL that can learn directly from diverse offline data is challenging, because it is hard to accurately estimate the exact value function for faraway goals. Nonetheless, goal-reaching problems exhibit structure, such that reaching distant goals entails first passing through closer subgoals. This structure can be very useful, as assessing the quality of actions for nearby goals is typically easier than for more distant goals.

artificial intelligence, machine learning, natural language, (8 more...)

Neural Information Processing Systems

Dec-25-2025, 22:57:10 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.76)
  - Natural Language (0.96)