Improved Sample Complexity for Incremental Autonomous Exploration in MDPs

Dec-24-2025, 05:52:58 GMT–Neural Information Processing Systems

We study the problem of exploring an unknown environment when no reward function is provided to the agent. Building on the incremental exploration setting introduced by Lim and Auer (2012), we define the objective of learning the set of $\epsilon$-optimal goal-conditioned policies attaining all states that are incrementally reachable within $L$ steps (in expectation) from a reference state $s_0$. In this paper, we introduce a novel model-based approach that interleaves discovering new states from $s_0$ and improving the accuracy of a model estimate that is used to compute goal-conditioned policies.

improved sample complexity, incremental autonomous exploration, name change, (7 more...)

Neural Information Processing Systems

Dec-24-2025, 05:52:58 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.39)

Technology:
- Information Technology > Artificial Intelligence (0.39)