LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment

Open in new window