Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need

Open in new window