Acceleration of stochastic gradient descent with momentum by averaging: finite-sample rates and asymptotic normality

Open in new window