Average gradient outer product as a mechanism for deep neural collapse

Neural Information Processing Systems 

This assumes that DNNs are infinitely expressive and, thus, optimizes for the feature vectors in the last layer directly.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found