DoTA-RAG: Dynamic of Thought Aggregation RAG
Ruangtanusak, Saksorn, Rungseesiripak, Natthapath, Rojratchadakorn, Peerawat, Charattrakool, Monthol, Nitarach, Natapong
–arXiv.org Artificial Intelligence
In this paper, we introduce DoTA-RAG (Dynamic-of-Thought Aggregation RAG), a retrieval-augmented generation system optimized for high-throughput, large-scale web knowledge indexes. Traditional RAG pipelines often suffer from high latency and limited accuracy over massive, diverse datasets. DoTA-RAG addresses these challenges with a three-stage pipeline: query rewriting, dynamic routing to specialized sub-indexes, and multi-stage retrieval and ranking. We further enhance retrieval by evaluating and selecting a superior embedding model, re-embedding the large FineWeb-10BT corpus. Moreover, we create a diverse Q&A dataset of 500 questions generated via the DataMorgana setup across a broad range of WebOrganizer topics and formats. DoTA-RAG improves the answer correctness score from 0.752 (baseline, using LiveRAG pre-built vector store) to 1.478 while maintaining low latency, and it achieves a 0.929 correctness score on the Live Challenge Day. These results highlight DoTA-RAG's potential for practical deployment in domains requiring fast, reliable access to large and evolving knowledge sources.
arXiv.org Artificial Intelligence
Jun-17-2025
- Country:
- Asia
- Japan (0.04)
- Middle East > Jordan (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Thailand > Bangkok
- Bangkok (0.05)
- Europe
- North America
- Canada > Ontario
- Toronto (0.04)
- United States (0.04)
- Canada > Ontario
- Asia
- Genre:
- Personal (0.68)
- Research Report (0.66)
- Industry:
- Banking & Finance (0.46)
- Health & Medicine (0.46)
- Technology: