Regret Bounds for Learning State Representations in Reinforcement Learning

Open in new window