Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

Open in new window