LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
Ma, Guangyuan, Ma, Yongliang, Gou, Xuanrui, Su, Zhenpeng, Zhou, Ming, Hu, Songlin
–arXiv.org Artificial Intelligence
Large Language Models (LLMs)-based text retrieval retrieves documents relevant to search queries based on vector similarities. Documents are pre-encoded offline, while queries arrive in real-time, necessitating an efficient online query encoder. Although LLMs significantly enhance retrieval capabilities, serving deeply parameterized LLMs slows down query inference throughput and increases demands for online deployment resources. In this paper, we propose LightRetriever, a novel LLM-based retriever with extremely lightweight query encoders. Our method retains a full-sized LLM for document encoding, but reduces the workload of query encoding to no more than an embedding lookup. Compared to serving a full LLM on an A800 GPU, our method achieves over 1000x speedup in query encoding and over 10x increase in end-to-end retrieval throughput. Extensive experiments on large-scale retrieval benchmarks show that LightRetriever generalizes well across diverse tasks, maintaining an average of 95% retrieval performance.
arXiv.org Artificial Intelligence
Sep-23-2025
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- Europe
- Austria > Vienna (0.14)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Germany (0.04)
- Italy
- Molise > Campobasso Province
- Campobasso (0.04)
- Tuscany > Florence (0.04)
- Molise > Campobasso Province
- Portugal > Lisbon
- Lisbon (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- British Columbia (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 15
- Dominican Republic (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Montgomery County
- Gaithersburg (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Oregon
- Benton County > Corvallis (0.04)
- Multnomah County > Portland (0.04)
- California > Los Angeles County
- Canada
- Oceania > New Zealand
- South Island > Otago > Dunedin (0.04)
- Africa > Ethiopia
- Genre:
- Research Report > New Finding (0.45)
- Industry:
- Health & Medicine (0.68)
- Information Technology (0.92)
- Technology: