Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent

Bai, Qinbo, Agarwal, Mridul, Aggarwal, Vaneet

Oct-2-2019–arXiv.org Machine Learning

Gradient descent and its variants are widely used in machine learning. However, oracle access of gradient may not be available in many applications, limiting the dire ct use of gradient descent. This paper proposes a method of estimating gradient to perform gradient descent, that converges to a stationary point for general non-convex optimization problems. Beyond the first-order stati onary properties, the second-order stationary properties are important in machine learning applications to achieve b etter performance. Gradient descent and its variants (e.g., Stochastic Gradie nt Descent) are widely used in machine learning due to their favorable computational properties, for examp le, in optimizing weights of a deep neural network. Recently, second order stationary guarant ees have been studied by using a perturbed version of gradient de scent [2].

algorithm, gradient, stationary point, (14 more...)

arXiv.org Machine Learning

Oct-2-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Indiana > Tippecanoe County
    - West Lafayette (0.04)
    - Lafayette (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found