Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

Open in new window