Reviews: A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization

Oct-8-2024, 08:12:46 GMT–Neural Information Processing Systems

This paper focuses on the optimization problem min f(x) h(x), where f is of a finite sum structure (with n functions in the sum), with nonconvex but smooth components, and h is a convex but possibly nonsmooth function. So, this is a nonconvex finite sum problem with a convex regularizer. Function h is treated using a prox step. The authors propose a small modification to ProxSVRG (called ProxSVRG), and prove that this small modification has surprisingly interesting consequences. The modification consists in replacing the full gradient computation in the outer loop of ProxSVRG by an approximation thereof through subsampling/minibatch (batch size B).

artificial intelligence, machine learning, proxsvrg, (14 more...)

Neural Information Processing Systems

Oct-8-2024, 08:12:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Mathematical & Statistical Methods (0.42)
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.42)