Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning