No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes

Open in new window