Local Smoothness in Variance Reduced Optimization Tong Zhang Dept. of Operations Research & Financial Engineering Dept. of Statistics Princeton University Rutgers University Princeton, NJ08544

Mar-12-2024, 21:43:28 GMT–Neural Information Processing Systems

We propose a family of non-uniform sampling strategies to provably speed up a class of stochastic optimization algorithms with linear convergence including Stochastic Variance Reduced Gradient (SVRG) and Stochastic Dual Coordinate Ascent (SDCA). For a large family of penalized empirical risk minimization problems, our methods exploit data dependent local smoothness of the loss functions near the optimum, while maintaining convergence guarantees. Our bounds are the first to quantify the advantage gained from local smoothness which are significant for some problems significantly better. Empirically, we provide thorough numerical results to back up our theory. Additionally we present algorithms exploiting local smoothness in more aggressive ways, which perform even better in practice.

algorithm, sdca, smoothness, (15 more...)

Neural Information Processing Systems

Mar-12-2024, 21:43:28 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > New Jersey
  - Middlesex County > Piscataway (0.04)
  - Mercer County > Princeton (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)