Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs

Open in new window