From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

Open in new window