Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search

Open in new window