Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training

Dec-24-2025, 12:46:42 GMT–Neural Information Processing Systems

Recently, sparse training has emerged as a promising paradigm for efficient deep learning on edge devices. The current research mainly devotes the efforts to reducing training costs by further increasing model sparsity. However, increasing sparsity is not always ideal since it will inevitably introduce severe accuracy degradation at an extremely high sparsity level. This paper intends to explore other possible directions to effectively and efficiently reduce sparse training costs while preserving accuracy. To this end, we investigate two techniques, namely, layer freezing and data sieving. First, the layer freezing approach has shown its success in dense model training and fine-tuning, yet it has never been adopted in the sparse training domain.

generic framework, layer freezing & data sieving, training cost, (7 more...)

Neural Information Processing Systems

Dec-24-2025, 12:46:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.76)