Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD

Open in new window