Optimistic planning in Markov decision processes using a generative model

Open in new window