A Stochastic Variance Reduced Nesterov's Accelerated Quasi-Newton Method
Yasuda, Sota, Mahboubi, Shahrzad, Indrapriyadarsini, S., Ninomiya, Hiroshi, Asai, Hideki
--Recently algorithms incorporating second order curvature information have become popular in training neural networks. The Nesterov's Accelerated Quasi-Newton (NAQ) method has shown to effectively accelerate the BFGS quasi-Newton method by incorporating the momentum term and Nesterov's accelerated gradient vector . A stochastic version of NAQ method was proposed for training of large-scale problems. This paper proposes a stochastic variance reduced Nesterov's Accelerated Quasi-Newton method in full (SVR-NAQ) and limited (SVR-LNAQ) memory forms. The performance of the proposed method is evaluated in T ensorflow on four benchmark problems - two regression and two classification problems respectively. The results show improved performance compared to conventional methods.
Oct-17-2019
- Country:
- North America > Canada
- Asia > Japan
- Honshū > Chūbu > Shizuoka Prefecture > Shizuoka (0.04)
- Genre:
- Research Report (0.70)