Adaptive Layer Sparsity for Large Language Models via Activation Correlation Assessment Wei Li1, Mark Lee 1

Jun-1-2025, 15:06:01 GMT–Neural Information Processing Systems

Large Language Models (LLMs) have revolutionized the field of natural language processing with their impressive capabilities. However, their enormous size presents challenges for deploying them in real-world applications. Traditional compression techniques, like pruning, often lead to suboptimal performance due to their uniform pruning ratios and lack of consideration for the varying importance of features across different layers. To address these limitations, we present a novel Adaptive Layer Sparsity (ALS) approach to optimize LLMs. Our approach consists of two key steps.

large language model, machine learning, sparsity, (20 more...)

Neural Information Processing Systems

Jun-1-2025, 15:06:01 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report > Experimental Study (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)