L*-Based Learning of Markov Decision Processes (Extended Version)