Normalization Layers Are All That Sharpness-Aware Minimization Needs Maximilian Müller University of Tübingen and Tübingen AI Center
–Neural Information Processing Systems
In this work we show that perturbing only the affine normalization parameters (typically comprising 0.1% of the total parameters) in the adversarial step of SAM can outperform perturbing all of the parameters.
Neural Information Processing Systems
Nov-19-2025, 21:56:09 GMT
- Country:
- Europe > Germany
- Baden-Württemberg > Tübingen Region > Tübingen (0.86)
- North America > Canada
- Europe > Germany
- Genre:
- Research Report > New Finding (0.67)
- Technology: