SGD for Structured Nonconvex Functions: Learning Rates, Minibatching and Interpolation

Open in new window