RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
Jiang, Pengcheng, Cao, Lang, Zhu, Ruike, Jiang, Minhao, Zhang, Yunyi, Sun, Jimeng, Han, Jiawei
–arXiv.org Artificial Intelligence
Retrieval-augmented language models often struggle with knowledge-intensive tasks due to inefficient retrieval, unstructured knowledge integration, and single-pass architectures. We present Retrieval-And-Structuring (RAS), a novel framework that dynamically constructs and reasons over query-specific knowledge graphs through iterative retrieval and structuring. RAS introduces four key technical innovations: (1) a themescoped retrieval mechanism that efficiently narrows the search space while maintaining retrieval quality, (2) an action planning module that determines knowledge needs and generates focused sub-queries, (3) a dynamic knowledge structuring approach that converts retrieved text into an evolving knowledge graph, and (4) a graph-augmented answering component that leverages the accumulated structured information. Our framework achieves state-of-the-art performance, surpassing leading baselines by 6.4% with open-source language models and 7.0% with proprietary models on seven knowledge-intensive generation datasets across all evaluation metrics. Detailed ablation studies verify the contribution of each technical component to the overall system performance.
arXiv.org Artificial Intelligence
Feb-16-2025
- Country:
- North America > United States (1.00)
- Genre:
- Personal > Honors (0.67)
- Research Report (0.64)
- Industry:
- Banking & Finance > Trading (0.67)
- Leisure & Entertainment > Sports
- Olympic Games (0.46)
- Media > Music (0.93)
- Technology: