Preconditioned Sharpness-Aware Minimization: Unifying Analysis and a Novel Learning Algorithm
Zhang, Yilang, Li, Bingcong, Giannakis, Georgios B.
–arXiv.org Artificial Intelligence
Abstract--Targeting solutions over'flat' regions of the loss landscape, sharpness-aware minimization (SAM) has emerged Various approaches have also been proposed to further I. Unfortunately, Advances in deep neural network (DNN) architectures have a unifying framework is lacking to encompass existing SAM led to impressive success across various domains including variants, and inspire the principled design of novel approaches. Owing to the markedly Toward this goal, the present work relies on preconditioning high dimensionality, DNNs can memorize a large gamut of to unify SAM variants; hence, the term preconditioned training data [4]. As a result, small loss during training does (pre) SAM. Depending on where preconditioning is not guarantee generalization to unseen data.
arXiv.org Artificial Intelligence
Jan-11-2025
- Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
- Genre:
- Research Report (0.84)
- Technology: