Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization