Goto

Collaborating Authors

 Optimization





NeverGoFullBatch (inStochasticConvexOptimization)

Neural Information Processing Systems

We study the generalization performance of full-batch optimization algorithms for stochastic convex optimization: these are first-order methods that only access the exact gradient of the empirical risk (rather than gradients with respect to individual data points), that include a wide range of algorithms such as gradient descent, mirror descent, and their regularized and/or accelerated variants.





aa84ec1ac3f5fdcf77bce2c22705ab77-Paper-Conference.pdf

Neural Information Processing Systems

Typical examples are hyperparameters selection [5, 38, 17, 6], data augmentation [11, 42], implicit deep learning [3] or neural architecture search [33]. Figure 1: Convergence curves of the two proposed methods on a toy problem.