Stochastic optimization and sparse statistical recovery: Optimal algorithms for high dimensions
–Neural Information Processing Systems
We develop and analyze stochastic optimization algorithms for problems in which the expected loss is strongly convex, and the optimum is (approximately) sparse. Previous approaches are able to exploit only one of these two structures, yielding a \order(\pdim/T) convergence rate for strongly convex objectives in \pdim dimensions and \order(\sqrt{\spindex( \log\pdim)/T}) convergence rate when the optimum is \spindex -sparse. Our algorithm is based on successively solving a series of \ell_1 -regularized optimization problems using Nesterov's dual averaging algorithm. We establish that the error of our solution after T iterations is at most \order(\spindex(\log\pdim)/T), with natural extensions to approximate sparsity. Our results apply to locally Lipschitz losses including the logistic, exponential, hinge and least-squares losses.
Neural Information Processing Systems
Feb-16-2024, 06:56:05 GMT
- Technology: