Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift