What Matters In The Structured Pruning of Generative Language Models?

Open in new window