One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models

Open in new window