Preserving LLMCapabilities through Calibration Data Curation: From Analysis to Optimization

Jun-17-2026, 08:57:50 GMT–Neural Information Processing Systems

Post-training compression has been a widely employed approach to scale down large language model (LLM) and facilitate efficient inference. In various proposed compression methods, including pruning and quantization, calibration data plays a vital role by informing the weight importance and activation dynamic ranges. However, how calibration data impacts the LLM capability after compression is less explored. Few of the existing works, though recognizing the significance of this study, only investigate the language modeling or commonsense reasoning performance degradation from limited angles, like the data sources or sample amounts. More systematic research is still needed to examine the impacts on different LLM capabilities in terms of compositional properties and domain correspondence of calibration data.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Jun-17-2026, 08:57:50 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.67)
- North America > United States
  - Minnesota (0.27)
- Europe > United Kingdom
  - England (0.27)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (0.67)
- Government (0.67)

Technology:
- Information Technology
  - Data Science > Data Quality (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found