Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion

Cutkosky, Ashok, Mehta, Harsh, Orabona, Francesco

Feb-11-2023–arXiv.org Artificial Intelligence

We present new algorithms for optimizing non-smooth, non-convex stochastic objectives based on a novel analysis technique. This improves the current best-known complexity for finding a $(\delta,\epsilon)$-stationary point from $O(\epsilon^{-4}\delta^{-1})$ stochastic gradient queries to $O(\epsilon^{-3}\delta^{-1})$, which we also show to be optimal. Our primary technique is a reduction from non-smooth non-convex optimization to online learning, after which our results follow from standard regret bounds in online learning. For deterministic and second-order smooth objectives, applying more advanced optimistic online learning techniques enables a new complexity of $O(\epsilon^{-1.5}\delta^{-0.5})$. Our techniques also recover all optimal or best-known results for finding $\epsilon$ stationary points of smooth or second-order smooth objectives in both stochastic and deterministic settings.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Artificial Intelligence

Feb-11-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Education > Educational Setting > Online (0.76)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Statistical Learning > Gradient Descent (0.36)
  - Enterprise Applications > Human Resources
    - Learning Management (0.76)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found