Review for NeurIPS paper: Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning

Neural Information Processing Systems 

In particular they convert epistemic uncertainty into "hallucinated controls" that are optimized, thereby leading to optimistic behavior.