Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes