Efficient Multi-task LLM Quantization and Serving for Multiple LoRA Adapters

Neural Information Processing Systems 

However, although these techniques have been widely adopted in single-task scenarios, research is scarce in multi-task scenarios.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found