LoRA-Augmented Generation (LAG) for Knowledge-Intensive Language Tasks
Fleshman, William, Van Durme, Benjamin
–arXiv.org Artificial Intelligence
The proliferation of fine-tuned language model experts for specific tasks and domains signals the need for efficient selection and combination methods. We propose LoRA-Augmented Generation (LAG) for leveraging large libraries of knowledge and task-specific LoRA adapters. LAG requires no additional training or access to data, and efficiently filters, retrieves, and applies experts on a per-token and layer basis. We evaluate LAG on various knowledge-intensive tasks, achieving superior performance over existing data-free methods. We explore scenarios where additional data is available, demonstrating LAG's compatibility with alternative solutions such as retrieval-augmented generation (RAG).
arXiv.org Artificial Intelligence
Aug-19-2025
- Country:
- Asia (1.00)
- North America > United States (0.68)
- Europe (0.68)
- Genre:
- Research Report (0.64)
- Industry:
- Information Technology (0.46)
- Technology: