TERAG: Token-Efficient Graph-Based Retrieval-Augmented Generation
Xiao, Qiao, Tsang, Hong Ting, Bai, Jiaxin
–arXiv.org Artificial Intelligence
Graph-based Retrieval-augmented generation (RAG) has become a widely studied approach for improving the reasoning, accuracy, and factuality of Large Language Models (LLMs). However, many existing graph-based RAG systems overlook the high cost associated with LLM token usage during graph construction, hindering large-scale adoption. To address this, we propose TERAG, a simple yet effective framework designed to build informative graphs at a significantly lower cost. Inspired by HippoRAG, we incorporate Personalized PageRank (PPR) during the retrieval phase, and we achieve at least 80% of the accuracy of widely used graph-based RAG methods while consuming only 3%-11% of the output tokens. With its low token footprint and efficient construction pipeline, TERAG is well-suited for large-scale and cost-sensitive deployment scenarios.
arXiv.org Artificial Intelligence
Nov-11-2025
- Country:
- Asia > China
- Hong Kong (0.04)
- North America > United States
- Florida
- Hillsborough County > University (0.04)
- Miami-Dade County > Miami (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- New York > New York County
- New York City (0.04)
- Florida
- Asia > China
- Genre:
- Research Report (0.85)
- Technology: