Horizon-Free Regret for Linear Markov Decision Processes

Open in new window