How SGD Selects the Global Minima in Over-parameterized Learning: A Dynamical Stability Perspective

Lei Wu, Chao Ma, Weinan E

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/