A Unified Analysis of Stochastic Momentum Methods for Deep Learning