Preserving LLMCapabilities through Calibration Data Curation: From Analysis to Optimization
–Neural Information Processing Systems
Post-training compression has been a widely employed approach to scale down large language model (LLM) and facilitate efficient inference. In various proposed compression methods, including pruning and quantization, calibration data plays a vital role by informing the weight importance and activation dynamic ranges. However, how calibration data impacts the LLM capability after compression is less explored. Few of the existing works, though recognizing the significance of this study, only investigate the language modeling or commonsense reasoning performance degradation from limited angles, like the data sources or sample amounts. More systematic research is still needed to examine the impacts on different LLM capabilities in terms of compositional properties and domain correspondence of calibration data.
Neural Information Processing Systems
Jun-17-2026, 08:57:50 GMT
- Country:
- Asia (0.67)
- North America > United States
- Minnesota (0.27)
- Europe > United Kingdom
- England (0.27)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Information Technology > Security & Privacy (0.67)
- Government (0.67)
- Technology: