Learning ReLU Networks on Linearly Separable Data: Algorithm, Optimality, and Generalization

Open in new window