Safe Exploration in Finite Markov Decision Processes with Gaussian Processes