Understanding the Role of Momentum in Stochastic Gradient Methods
Igor Gitman, Hunter Lang, Pengchuan Zhang, Lin Xiao
–Neural Information Processing Systems
The use of momentum in stochastic gradient methods has become a widespread practice in machine learning. Different variants of momentum, including heavy-ball momentum, Nesterov's accelerated gradient (NAG), and quasi-hyperbolic momentum (QHM), have demonstrated success on various tasks.
Neural Information Processing Systems
Oct-2-2025, 17:02:50 GMT
- Country:
- Asia > Russia (0.04)
- Europe
- Austria (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Russia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > British Columbia
- United States
- Georgia > Fulton County
- Atlanta (0.04)
- New York (0.04)
- Washington > King County
- Redmond (0.04)
- Georgia > Fulton County
- Genre:
- Research Report > New Finding (0.47)