SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs

Open in new window