Progress in Nonsmooth Optimization part4(Machine Learning)

Mar-15-2023, 07:00:21 GMT–#artificialintelligence

Abstract:: We propose and analyze several stochastic gradient algorithms for finding stationary points or local minimum in nonconvex, possibly with nonsmooth regularizer, finite-sum and online optimization problems. First, we propose a simple proximal stochastic gradient algorithm based on variance reduction called ProxSVRG . We provide a clean and tight analysis of ProxSVRG, which shows that it outperforms the deterministic proximal gradient descent (ProxGD) for a wide range of minibatch sizes, hence solves an open problem proposed in Reddi et al. (2016b). Also, ProxSVRG uses much less proximal oracle calls than ProxSVRG (Reddi et al., 2016b) and extends to the online setting by avoiding full gradient computations. Then, we further propose an optimal algorithm, called SSRGD, based on SARAH (Nguyen et al., 2017) and show that SSRGD further improves the gradient complexity of ProxSVRG and achieves the optimal upper bound, matching the known lower bound of (Fang et al., 2018; Li et al., 2021).

algorithm, machine learning, nonsmooth optimization part4, (10 more...)

#artificialintelligence

Mar-15-2023, 07:00:21 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.83)
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.80)