On Optimism in Model-Based Reinforcement Learning