Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models

Open in new window