Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Open in new window