Polyak's Heavy Ball Method Achieves Accelerated Local Rate of Convergence under Polyak-Lojasiewicz Inequality
Kassing, Sebastian, Weissmann, Simon
–arXiv.org Artificial Intelligence
In this work, we consider the convergence of Polyak's heavy ball method, both in continuous and discrete time, on a non-convex objective function. We recover the convergence rates derived in [Pol64] for strongly convex objective functions, assuming only validity of the Polyak-Lojasiewicz inequality. In continuous time our result holds for all initializations, whereas in the discrete time setting we conduct a local analysis around the global minima. Our results demonstrate that the heavy ball method does, in fact, accelerate on the class of objective functions satisfying the Polyak-Lojasiewicz inequality. This holds even in the discrete time setting, provided the method reaches a neighborhood of the global minima. Instead of the usually employed Lyapunov-type arguments, our approach leverages a new differential geometric perspective of the Polyak-Lojasiewicz inequality proposed in [RB24].
arXiv.org Artificial Intelligence
Oct-22-2024
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Germany > Saxony
- Dresden (0.04)
- Montenegro (0.04)
- Germany > Saxony
- North America > United States (0.04)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.74)
- Industry:
- Leisure & Entertainment > Sports > Tennis (0.86)
- Technology: