CLAP: Coreference-Linked Augmentation for Passage Retrieval
Xu, Huanwei, Xu, Lin, Yuan, Liang
–arXiv.org Artificial Intelligence
Large Language Model (LLM)-based passage expansion has shown promise for enhancing first-stage retrieval, but often underperforms with dense retrievers due to semantic drift and misalignment with their pretrained semantic space. Beyond this, only a portion of a passage is typically relevant to a query, while the rest introduces noise--an issue compounded by chunking techniques that break coreference continuity. We propose Coreference-Linked Augmentation for Passage Retrieval (CLAP), a lightweight LLM-based expansion framework that segments passages into coherent chunks, resolves coreference chains, and generates localized pseudo-queries aligned with dense retriever representations. A simple fusion of global topical signals and fine-grained subtopic signals achieves robust performance across domains. CLAP yields consistent gains even as retriever strength increases, enabling dense retrievers to match or surpass second-stage rankers such as BM25 + MonoT5-3B, with up to 20.68% absolute nDCG@10 improvement. These improvements are especially notable in out-of-domain settings, where conventional LLM-based expansion methods relying on domain knowledge often falter. CLAP instead adopts a logic-centric pipeline that enables robust, domain-agnostic generalization.
arXiv.org Artificial Intelligence
Aug-26-2025
- Country:
- Asia
- Indonesia (0.04)
- Middle East > Jordan (0.04)
- South Korea > Seoul
- Seoul (0.05)
- Thailand > Bangkok
- Bangkok (0.04)
- Vietnam (0.04)
- Europe
- France (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- The Bahamas (0.14)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- Canada
- Oceania > Australia
- Queensland (0.04)
- South Australia > Adelaide (0.04)
- South America
- Asia
- Genre:
- Research Report (0.82)
- Technology: