Active Exploration in Markov Decision Processes