Regret Bounds for Kernel-Based Reinforcement Learning

Open in new window