Data-Efficient Structured Pruning via Submodular Optimization

Jan-19-2025, 06:02:05 GMT–Neural Information Processing Systems

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and often require fine-tuning, which makes them inapplicable in the limited-data regime. We propose a principled data-efficient structured pruning method based on submodular optimization. In particular, for a given layer, we select neurons/channels to prune and corresponding new weights for the next layer, that minimize the change in the next layer's input induced by pruning. We show that this selection problem is a weakly submodular maximization problem, thus it can be provably approximated using an efficient greedy algorithm.

data-efficient structured pruning, limited-data regime, submodular optimization, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 06:02:05 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.42)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.62)