Deep linear networks for regression are implicitly regularized towards flat minima

Neural Information Processing Systems 

The largest eigenvalue of the Hessian, or sharpness, of neural networks is a key quantity to understand their optimization dynamics.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found