On the Role of Optimization in Double Descent: A Least Squares Study

Neural Information Processing Systems 

One of their most intriguing features is their ability to perform better with scale. Indeed, some of the most impressive results [see e.g.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found