Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models