3BASiL: An Algorithmic Framework for Sparseplus Low-Rank Compression of LLMs
–Neural Information Processing Systems
Sparse plus Low-Rank (S + LR) decomposition of Large Language Models (LLMs) has emerged as a promising direction in model compression, aiming to decompose pre-trained model weights into a sum of sparse and low-rank matrices W S + LR. Despite recent progress, existing methods often suffer from substantial performance degradation compared to dense models. In this work, we introduce 3BASiL-TM, an efficient one-shot post-training method for (S + LR) decomposition of LLMs that addresses this gap. Our approach first introduces a novel 3-Block Alternating Direction Method of Multipliers (ADMM) method, termed 3BASiL, to minimize the layer-wise reconstruction error with convergence guarantees.
Neural Information Processing Systems
Jun-23-2026, 04:21:25 GMT
- Country:
- North America > United States > Minnesota (0.27)
- Genre:
- Research Report > Experimental Study (1.00)
- Technology: