Effective Bayesian Heteroscedastic Regression with Deep Neural Networks Alexander Immer

Neural Information Processing Systems 

Further, we emphasize the significance of principled regularization of the network parameters and prediction.