Average gradient outer product as a mechanism for deep neural collapse Daniel Beaglehole,1 Peter Súkeník,2 Marco Mondelli 2 Mikhail Belkin 1 1

Neural Information Processing Systems 

This assumes that DNNs are infinitely expressive and, thus, optimizes for the feature vectors in the last layer directly.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found