On the Periodic Behavior of Neural Network Training with Batch Normalization and Weight Decay