Frustratingly Easy Task-aware Pruning for Large Language Models

Open in new window