DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning