Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes

Open in new window