Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs UMPA, ENS Lyon Paris, France

Open in new window