Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization