Last-iterate convergence analysis of stochastic momentum methods for neural networks