Bayesian Nonparametric Multi-Optima Policy Search in Reinforcement Learning

Open in new window