Non-Convex Stochastic Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics
Hu, Yuanhan, Wang, Xiaoyu, Gao, Xuefeng, Gurbuzbalaban, Mert, Zhu, Lingjiong
Stochastic gradient Langevin dynamics (SGLD) is a poweful algorithm for optimizing a non-convex objective, where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates towards a global minimum. SGLD is based on the overdamped Langevin diffusion which is reversible in time. By adding an anti-symmetric matrix to the drift term of the overdamped Langevin diffusion, one gets a non-reversible diffusion that converges to the same stationary distribution with a faster convergence rate. In this paper, we study the non-reversible stochastic gradient Langevin dynamics (NSGLD) which is based on discretization of the non-reversible Langevin diffusion. We provide finite time performance bounds for the global convergence of NSGLD for solving stochastic non-convex optimization problems. Our results lead to non-asymptotic guarantees for both population and empirical risk minimization problems. Numerical experiments for a simple polynomial function optimization, Bayesian independent component analysis and neural network models show that NSGLD can outperform SGLD with proper choices of the anti-symmetric matrix.
Apr-6-2020
- Country:
- North America > United States
- New Jersey > Middlesex County
- Piscataway (0.04)
- Florida > Leon County
- Tallahassee (0.04)
- New Jersey > Middlesex County
- Europe > France
- Île-de-France > Paris
- Paris (0.04)
- Occitanie > Haute-Garonne
- Toulouse (0.04)
- Île-de-France > Paris
- Asia
- China > Hong Kong (0.04)
- Afghanistan > Parwan Province
- Charikar (0.04)
- North America > United States
- Genre:
- Research Report (0.84)
- Industry:
- Health & Medicine (0.68)
- Technology: