Reviews: Global Sparse Momentum SGD for Pruning Very Deep Neural Networks

Feb-5-2025, 08:22:28 GMT–Neural Information Processing Systems

The paper proposes a method for pruning deep networks based on the largest values of the gradient vector. The idea is new compared to previous attempts; although it is somewhat related to Fisher pruning, that is also based on magnitudes of gradients, the method here is more of an SGD variant rather than a post-training evaluation method. The techniques do not come with rigorous guarantees, but the reviewers agree that the experiments and surrounding studies are interesting enough to incite future research around this method.

deep neural network, global sparse momentum sgd, review, (2 more...)

Neural Information Processing Systems

Feb-5-2025, 08:22:28 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)