Third-order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima

Dec-31-2018–Neural Information Processing Systems

We propose stochastic optimization algorithms that can find local minima faster than existing algorithms for nonconvex optimization problems, by exploiting the third-order smoothness to escape non-degenerate saddle points more efficiently. More specifically, the proposed algorithm only needs $\tilde{O}(\epsilon^{-10/3})$ stochastic gradient evaluations to converge to an approximate local minimum $\mathbf{x}$, which satisfies $\|\nabla f(\mathbf{x})\|_2\leq\epsilon$ and $\lambda_{\min}(\nabla^2 f(\mathbf{x}))\geq -\sqrt{\epsilon}$ in unconstrained stochastic optimization, where $\tilde{O}(\cdot)$ hides logarithm polynomial terms and constants. This improves upon the $\tilde{O}(\epsilon^{-7/2})$ gradient complexity achieved by the state-of-the-art stochastic local minima finding algorithms by a factor of $\tilde{O}(\epsilon^{-1/6})$. Experiments on two nonconvex optimization problems demonstrate the effectiveness of our algorithm and corroborate our theory.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Dec-31-2018

Conferences PDF

Add feedback

Country:
- Africa > Middle East
  - Tunisia > Ben Arous Governorate > Ben Arous (0.04)
- Asia > Middle East
  - Jordan (0.05)
- North America
  - Canada > Quebec
    - Montreal (0.04)
  - United States
    - California > Los Angeles County
      - Los Angeles (0.29)
    - Virginia > Albemarle County
      - Charlottesville (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.39)
  - Representation & Reasoning > Optimization (1.00)

Duplicate Docs Excel Report

Title
Third-order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima
Third-order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima

Similar Docs Excel Report more

Title	Similarity	Source
None found