A Large-Scale Study of Probabilistic Calibration in Neural Network Regression