Nonparametric Model-Based Reinforcement Learning