Sharp Analysis of Stochastic Optimization under Global Kurdyka-Łojasiewicz Inequality

Apr-26-2026, 12:58:19 GMT–Neural Information Processing Systems

We study the complexity of finding the global solution to stochastic nonconvex optimization when the objective function satisfies global Kurdyka-Łojasiewicz (KŁ) inequality and the queries from stochastic gradient oracles satisfy mild expected smoothness assumption. We first introduce a general framework to analyze Stochastic Gradient Descent (SGD) and its associated nonlinear dynamics under the setting. As a byproduct of our analysis, we obtain a sample complexity of O(ϵ (4 α)/α) for SGD when the objective satisfies the so called α-PŁ condition, where α is the degree of gradient domination. Furthermore, we show that a modified SGD with variance reduction and restarting (PAGER) achieves an improved sample complexity of O(ϵ 2/α)when the objective satisfies the average smoothness assumption. This leads to the first optimal algorithm for the important case of α = 1 which appears in applications such as policy optimization in reinforcement learning.

artificial intelligence, complexity, machine learning, (13 more...)

Neural Information Processing Systems

Apr-26-2026, 12:58:19 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Mathematical & Statistical Methods (1.00)
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.76)

Duplicate Docs Excel Report

Title
65ae674df2fb642518ae8d2b5435e1b8-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found