Tighter Value Function Bounds for Bayesian Reinforcement Learning

Open in new window