The Power of Interpolation: Understanding the Effectiveness of SGD in Modern Over-parametrized Learning