DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning

Open in new window