RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
Jiang, Pengcheng, Cao, Lang, Zhu, Ruike, Jiang, Minhao, Zhang, Yunyi, Sun, Jimeng, Han, Jiawei
–arXiv.org Artificial Intelligence
Retrieval-augmented language models often struggle with knowledge-intensive tasks due to inefficient retrieval, unstructured knowledge integration, and single-pass architectures. We present Retrieval-And-Structuring (RAS), a novel framework that dynamically constructs and reasons over query-specific knowledge graphs through iterative retrieval and structuring. RAS introduces four key technical innovations: (1) a themescoped retrieval mechanism that efficiently narrows the search space while maintaining retrieval quality, (2) an action planning module that determines knowledge needs and generates focused sub-queries, (3) a dynamic knowledge structuring approach that converts retrieved text into an evolving knowledge graph, and (4) a graph-augmented answering component that leverages the accumulated structured information. Our framework achieves state-of-the-art performance, surpassing leading baselines by 6.4% with open-source language models and 7.0% with proprietary models on seven knowledge-intensive generation datasets across all evaluation metrics. Detailed ablation studies verify the contribution of each technical component to the overall system performance.
arXiv.org Artificial Intelligence
Feb-16-2025
- Country:
- Europe
- Middle East > Malta (0.04)
- United Kingdom > England
- Lincolnshire (0.04)
- North America > United States
- Illinois > Cook County
- Chicago (0.04)
- New York (0.04)
- Ohio (0.05)
- Illinois > Cook County
- Oceania > Australia (0.04)
- Europe
- Genre:
- Personal > Honors (0.67)
- Research Report (0.64)
- Industry:
- Banking & Finance > Trading (0.67)
- Leisure & Entertainment > Sports
- Olympic Games (0.46)
- Media > Music (0.93)
- Technology: