Random Sharpness-Aware Minimization

Neural Information Processing Systems 

Currently, Sharpness-A ware Minimization (SAM) is proposed to seek the parameters that lie in a flat region to improve the generalization when training neural networks.