Inferring the Optimal Policy using Markov Chain Monte Carlo

Open in new window