Sample Complexity of Goal-Conditioned Hierarchical Reinforcement Learning

Jan-19-2025, 21:36:16 GMT–Neural Information Processing Systems

Hierarchical Reinforcement Learning (HRL) algorithms can perform planning at multiple levels of abstraction. Empirical results have shown that state or temporal abstractions might significantly improve the sample efficiency of algorithms. Yet, we still do not have a complete understanding of the basis of those efficiency gains nor any theoretically grounded design rules. In this paper, we derive a lower bound on the sample complexity for the considered class of goal-conditioned HRL algorithms. The proposed lower bound empowers us to quantify the benefits of hierarchical decomposition and leads to the design of a simple Q-learning-type algorithm that leverages hierarchical decompositions.

algorithm, goal-conditioned hierarchical reinforcement learning, hierarchical reinforcement learning, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 21:36:16 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)