Over-Parameterized Deep Neural Networks Have No Strict Local Minima For Any Continuous Activations

Open in new window