Settling the Horizon-Dependence of Sample Complexity in Reinforcement Learning

Open in new window