ZipLM: Inference-Aware Structured Pruning of Language Models