On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL Jinglin Chen

Neural Information Processing Systems 

Our analyses indicate that the explorability or reachability assumptions, previously made for the latter two settings, are not necessary statistically for reward-free exploration.