DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

May-30-2025, 17:18:48 GMT–Neural Information Processing Systems

Large Language Models (LLMs) have achieved remarkable success in various natural language processing tasks, including language modeling, understanding, and generation. However, the increased memory and computational costs associated with these models pose significant challenges for deployment on resource-limited devices. Structural pruning has emerged as a promising solution to reduce the costs of LLMs without requiring post-processing steps. Prior structural pruning methods either follow the dependence of structures at the cost of limiting flexibility, or introduce non-trivial additional parameters by incorporating different projection matrices. In this work, we propose a novel approach that relaxes the constraint imposed by regular structural pruning methods and eliminates the structural dependence along the embedding dimension.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

May-30-2025, 17:18:48 GMT

Conferences PDF

Add feedback

Genre:
- Research Report
  - Experimental Study (0.93)
  - Promising Solution (0.86)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

Similar Docs Excel Report more

Title	Similarity	Source
None found