Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron