On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport

Lénaïc Chizat, Francis Bach

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/