Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction

Open in new window