AITopics | magnitude pruning

Collaborating Authors

magnitude pruning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Controlled Sparsity via Constrained Optimization or: How ILearned to Stop Tuning Penalties and Love Constraints

Neural Information Processing SystemsApr-24-2026, 10:51:30 GMT

The performance of trained neural networks is robust to harsh levels of pruning. Coupled with the ever-growing size of deep learning models, this observation has motivated extensive research on learning sparse models. In this work, we focus on the task of controlling the level of sparsity when performing sparse learning. Existing methods based on sparsity-inducing penalties involve expensive trial-anderror tuning of the penalty factor, thus lacking direct control of the resulting model sparsity. In response, we adopt a constrained formulation: using the gate mechanism proposed by Louizos et al. [31], we formulate a constrained optimization problem where sparsification is guided by the training objective and the desired sparsity target in an end-to-end fashion. Experiments on CIFAR-{10, 100}, TinyImageNet, and ImageNet using WideResNet and ResNet{18, 50} models validate the effectiveness of our proposal and demonstrate that we can reliably achieve pre-determined sparsity targets without compromising on predictive performance.

artificial intelligence, constraint, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

f5c3dd7514bf620a1b85450d2ae374b1-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 22:48:56 GMT

generalization, neural network, pruning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Europe > Switzerland (0.04)

Genre: Research Report > New Finding (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

eae15aabaa768ae4a5993a8a4f4fa6e4-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 23:16:40 GMT

magnitude pruning, movement pruning, pruning, (13 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Genre: Research Report (0.68)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

eae15aabaa768ae4a5993a8a4f4fa6e4-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-10-2026, 23:16:28 GMT

artificial intelligence, machine learning, pruning, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback

WoodFisher: EfficientSecond-OrderApproximation forNeuralNetworkCompression

Neural Information Processing SystemsFeb-10-2026, 12:43:57 GMT

Recently, there has been significant interest in utilizing this information in the context of deep neural networks; however,relatively little isknown about the quality ofexisting approximationsinthiscontext.

artificial intelligence, machine learning, pruning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Indiana > Hamilton County > Fishers (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Switzerland (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

089b592cccfafdca8e0178e85b609f19-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 08:36:36 GMT

In this work, we focus on the task of controlling the level of sparsity when performing sparse learning.

artificial intelligence, cit, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada > Quebec (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Sparse Training via Boosting Pruning Plasticity with Neuroregeneration

Neural Information Processing SystemsDec-24-2025, 03:06:51 GMT

Works on lottery ticket hypothesis (LTH) and single-shot network pruning (SNIP) have raised a lot of attention currently on post-training pruning (iterative magnitude pruning), and before-training pruning (pruning at initialization). The former method suffers from an extremely large computation cost and the latter usually struggles with insufficient performance. In comparison, during-training pruning, a class of pruning methods that simultaneously enjoys the training/inference efficiency and the comparable performance, temporarily, has been less explored. To better understand during-training pruning, we quantitatively study the effect of pruning throughout training from the perspective of pruning plasticity (the ability of the pruned networks to recover the original performance). Pruning plasticity can help explain several other empirical observations about neural network pruning in literature. We further find that pruning plasticity can be substantially improved by injecting a brain-inspired mechanism called neuroregeneration, i.e., to regenerate the same number of connections as pruned. We design a novel gradual magnitude pruning (GMP) method, named gradual pruning with zero-cost neuroregeneration (GraNet), that advances state of the art. Perhaps most impressively, its sparse-to-sparse version for the first time boosts the sparse-to-sparse training performance over various dense-to-sparse methods with ResNet-50 on ImageNet without extending the training time.

pruning, pruning plasticity, sparse training, (9 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Gambling (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

CoDeQ: End-to-End Joint Model Compression with Dead-Zone Quantizer for High-Sparsity and Low-Precision Networks

Wenshøj, Jonathan, Chen, Tong, Pepin, Bob, Selvan, Raghavendra

arXiv.org Machine LearningDec-16-2025

While joint pruning--quantization is theoretically superior to sequential application, current joint methods rely on auxiliary procedures outside the training loop for finding compression parameters. This reliance adds engineering complexity and hyperparameter tuning, while also lacking a direct data-driven gradient signal, which might result in sub-optimal compression. In this paper, we introduce CoDeQ, a simple, fully differentiable method for joint pruning--quantization. Our approach builds on a key observation: the dead-zone of a scalar quantizer is equivalent to magnitude pruning, and can be used to induce sparsity directly within the quantization operator. Concretely, we parameterize the dead-zone width and learn it via backpropagation, alongside the quantization parameters. This design provides explicit control of sparsity, regularized by a single global hyperparameter, while decoupling sparsity selection from bit-width selection. The result is a method for Compression with Dead-zone Quantizer (CoDeQ) that supports both fixed-precision and mixed-precision quantization (controlled by an optional second hyperparameter). It simultaneously determines the sparsity pattern and quantization parameters in a single end-to-end optimization. Consequently, CoDeQ does not require any auxiliary procedures, making the method architecture-agnostic and straightforward to implement. On ImageNet with ResNet-18, CoDeQ reduces bit operations to ~5% while maintaining close to full precision accuracy in both fixed and mixed-precision regimes.

codeq, quantization, quantizer, (14 more...)

arXiv.org Machine Learning

2512.12981

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Denmark > Capital Region > Copenhagen (0.05)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Pruning as Regularization: Sensitivity-Aware One-Shot Pruning in ASR

Irigoyen, Julian, Söhler, Arthur, Kirkedal, Andreas Søeborg

arXiv.org Artificial IntelligenceNov-12-2025

We challenge the conventional view of neural network pruning as solely a compression technique, demonstrating that one-shot magnitude pruning serves as a powerful implicit regularizer for ASR. Using Whisper-small, we combine gradient- and Fisher-based sensitivity diagnostics with targeted, component-wise pruning. This reveals architectural asymmetries: decoder FFNs are pruning-fragile, whereas decoder self-attention and the last encoder layers contain redundancy that, when removed, improves generalization. Without fine-tuning, pruning 50% of decoder self-attention reduces WER by 2.38% absolute (20.44% relative) on LibriSpeech test-other; pruning the last four encoder layers at 50% instead yields a 1.72% absolute (14.8% relative) improvement. Gains persisted on Common Voice and TED-LIUM datasets. Beyond regularization benefits, our sensitivity-aware approach enables more aggressive one-shot compression. At 40% sparsity, where established global pruning approaches catastrophically fail, our method preserves near-baseline accuracy. This positions pruning as a first-class architectural design tool: knowing where to prune is as important as how much to prune.

artificial intelligence, machine learning, pruning, (17 more...)

arXiv.org Artificial Intelligence

2511.08092

Country: Europe > Denmark (0.15)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Filters

Collaborating Authors

magnitude pruning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Controlled Sparsity via Constrained Optimization or: How ILearned to Stop Tuning Penalties and Love Constraints

f5c3dd7514bf620a1b85450d2ae374b1-Supplemental.pdf

f5c3dd7514bf620a1b85450d2ae374b1-Paper.pdf

eae15aabaa768ae4a5993a8a4f4fa6e4-Paper.pdf

eae15aabaa768ae4a5993a8a4f4fa6e4-AuthorFeedback.pdf

WoodFisher: EfficientSecond-OrderApproximation forNeuralNetworkCompression

089b592cccfafdca8e0178e85b609f19-Supplemental-Conference.pdf

Sparse Training via Boosting Pruning Plasticity with Neuroregeneration

CoDeQ: End-to-End Joint Model Compression with Dead-Zone Quantizer for High-Sparsity and Low-Precision Networks

Pruning as Regularization: Sensitivity-Aware One-Shot Pruning in ASR