From Local to Global: Revisiting Structured Pruning Paradigms for Large Language Models

Open in new window