Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models

Open in new window