The asymptotic spectrum of the Hessian of DNN throughout training
Jacot, Arthur, Gabriel, Franck, Hongler, Clément
The dynamics of DNNs during gradient descent is described by the so-called Neural Tangent Kernel (NTK). In this article, we show that the NTK allows one to gain precise insight into the Hessian of the cost of DNNs: we obtain a full characterization of the asymptotics of the spectrum of the Hessian, at initialization and during training.
Oct-1-2019
- Country:
- Africa > Middle East
- Tunisia > Ben Arous Governorate > Ben Arous (0.04)
- North America > United States
- California > Los Angeles County
- Long Beach (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New Jersey > Hudson County
- Secaucus (0.04)
- New York (0.04)
- California > Los Angeles County
- Africa > Middle East
- Genre:
- Research Report (0.63)
- Technology: