Loquetier: AVirtualized Multi-LoRA Framework for Unified LLMFine-tuning and Serving
–Neural Information Processing Systems
Low-Rank Adaptation (LoRA) has become a widely adopted parameter-efficient fine-tuning (PEFT) technique for adapting large language models (LLMs) to downstream tasks. While prior work has explored strategies for integrating LLM training and serving, there still remains a gap in unifying fine-tuning and inference for LoRA-based models.
Neural Information Processing Systems
Jun-17-2026, 15:59:07 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology (0.46)
- Education (0.46)
- Technology: