Momentum-Based Variance Reduction in Non-Convex SGD

Open in new window