Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes

Open in new window