ZipLM: Inference-Aware Structured Pruning of Language Models

Open in new window