On the Provable Suboptimality of Momentum SGD in Nonstationary Stochastic Optimization

Open in new window