SUBP: Soft Uniform Block Pruning for 1 \times N Sparse CNNs Multithreading Acceleration

Dec-26-2025, 11:51:36 GMT–Neural Information Processing Systems

The study of sparsity in Convolutional Neural Networks (CNNs) has become widespread to compress and accelerate models in environments with limited resources. By constraining N consecutive weights along the output channel to be group-wise non-zero, the recent network with 1$\times$N sparsity has received tremendous popularity for its three outstanding advantages: 1) A large amount of storage space saving by a \emph{Block Sparse Row} matrix.

name change, soft uniform block pruning, sparse cnn multithreading acceleration, (7 more...)

Neural Information Processing Systems

Dec-26-2025, 11:51:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)