Optimal Training of Mean Variance Estimation Neural Networks