Regret Bounds for Reinforcement Learning with Policy Advice

Open in new window