Optimistic PAC Reinforcement Learning: the Instance-Dependent View

Open in new window