Gradient Descent on Two-layer Nets: Margin Maximization and Simplicity Bias

Neural Information Processing Systems 

On the pessimistic side, the paper suggests that such results are fragile.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found