Tighter Value Function Bounds for Bayesian Reinforcement Learning