c82836ed448c41094025b4a872c5341e-Supplemental.pdf
–Neural Information Processing Systems
Recently there has been significant theoretical progress on understanding the convergence andgeneralization ofgradient-based methods onnonconvexlosses withoverparameterized models. Nevertheless, manyaspectsofoptimization and generalization and in particular the critical role of small random initialization are not fully understood.
Neural Information Processing Systems
Feb-11-2026, 03:23:29 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > Germany
- Bavaria > Upper Bavaria > Ingolstadt (0.04)
- North America > United States
- Maryland > Baltimore (0.04)
- New York > New York County
- New York City (0.14)
- Asia > Middle East
- Technology: