Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion

Yoon, Yejun, Jung, Jaeyoon, Yoon, Seunghyun, Park, Kunwoo

Jun-5-2025–arXiv.org Artificial Intelligence

Query expansion methods powered by large language models (LLMs) have demonstrated effectiveness in zero-shot retrieval tasks. These methods assume that LLMs can generate hypothetical documents that, when incorporated into a query vector, enhance the retrieval of real evidence. However, we challenge this assumption by investigating whether knowledge leakage in benchmarks contributes to the observed performance gains. Using fact verification as a testbed, we analyze whether the generated documents contain information entailed by ground-truth evidence and assess their impact on performance. Our findings indicate that, on average, performance improvements consistently occurred for claims whose generated documents included sentences entailed by gold evidence. This suggests that knowledge leakage may be present in fact-verification benchmarks, potentially inflating the perceived performance of LLM-based query expansion methods.

computational linguistic, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jun-5-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Nebraska (0.28)
- Asia > Middle East
  - UAE (0.46)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Law > Government & the Courts (0.93)
- Health & Medicine
  - Therapeutic Area (1.00)
  - Public Health (0.93)
- Government > Regional Government
  - North America Government > United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Information Retrieval > Query Processing (0.92)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found