Average gradient outer product as a mechanism for deep neural collapse

Open in new window