Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs

Open in new window