On the Generalization of Stochastic Gradient Descent with Momentum

Open in new window