Neglected Hessian component explains mysteries in sharpness regularization

Oct-10-2025, 20:50:35 GMT–Neural Information Processing Systems

SAM can improve generalization in deep learning. Seemingly similar methods like weight noise and gradient penalties often fail to provide such benefits. We investigate this inconsistency and reveal its connection to the the structure of the Hessian of the loss.

derivative, information, penalty, (15 more...)

Neural Information Processing Systems

Oct-10-2025, 20:50:35 GMT

Conferences PDF

Add feedback

Country:
- Europe > Austria (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.93)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
Neglected Hessian component explains mysteries in sharpness regularization

Similar Docs Excel Report more

Title	Similarity	Source
None found