Exploiting Sparsity in Pruned Neural Networks to Optimize Large Model Training

Open in new window