Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling

Open in new window