Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Dec-24-2025, 18:30:45 GMT–Neural Information Processing Systems

Unstructured pruning reduces the memory footprint in deep neural networks (DNNs). Recently, researchers proposed different types of structural pruning intending to reduce also the computation complexity. In this work, we first suggest a new measure called mask-diversity which correlates with the expected accuracy of the different types of structural pruning. We focus on the recently suggested N:M fine-grained block sparsity mask, in which for each block of M weights, we have at least N zeros. While N:M fine-grained block sparsity allows acceleration in actual modern hardware, it can be used only to accelerate the inference phase.

accelerated sparse neural training, name change, provable and efficient method, (7 more...)

Neural Information Processing Systems

Dec-24-2025, 18:30:45 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.58)