How regularization affects the geometry of loss functions
Bottman, Nathaniel, Cooper, Y., Lerario, Antonio
–arXiv.org Artificial Intelligence
What neural networks learn depends fundamentally on the geometry of the underlying loss function. We study how different regularizers affect the geometry of this function. One of the most basic geometric properties of a smooth function is whether it is Morse or not. For nonlinear deep neural networks, the unregularized loss function $L$ is typically not Morse. We consider several different regularizers, including weight decay, and study for which regularizers the regularized function $L_\epsilon$ becomes Morse.
arXiv.org Artificial Intelligence
Jul-28-2023
- Country:
- Asia > Middle East
- Israel (0.04)
- North America > United States
- New York (0.04)
- Asia > Middle East
- Genre:
- Research Report (0.82)
- Technology: