Gradient Descent Maximizes the Margin of Homogeneous Neural Networks

Open in new window