Identifying and attacking the saddle point problem in high-dimensional non-convex optimization
Yann N. Dauphin, Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, Surya Ganguli, Yoshua Bengio
–Neural Information Processing Systems
W e apply this algorithm to deep or recurrent neural network training, and provide numerical evidence for its superior optimization performance.
Neural Information Processing Systems
Aug-11-2025, 13:27:45 GMT
- Technology: