Source Coverage and Citation Bias in LLM-based vs. Traditional Search Engines

Zhang, Peixian, Ye, Qiming, Peng, Zifan, Garimella, Kiran, Tyson, Gareth

Dec-11-2025–arXiv.org Artificial Intelligence

LLM-based Search Engines (LLM-SEs) introduces a new paradigm for information seeking. Unlike Traditional Search Engines (TSEs) (e.g., Google), these systems summarize results, often providing limited citation transparency. The implications of this shift remain largely unexplored, yet raises key questions regarding trust and transparency. In this paper, we present a large-scale empirical study of LLM-SEs, analyzing 55,936 queries and the corresponding search results across six LLM-SEs and two TSEs. We confirm that LLM-SEs cites domain resources with greater diversity than TSEs. Indeed, 37% of domains are unique to LLM-SEs. However, certain risks still persist: LLM-SEs do not outperform TSEs in credibility, political neutrality and safety metrics. Finally, to understand the selection criteria of LLM-SEs, we perform a feature-based analysis to identify key factors influencing source choice. Our findings provide actionable insights for end users, website owners, and developers.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Dec-11-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China
    - Guangdong Province > Guangzhou (0.05)
    - Hong Kong (0.40)
  - Indonesia > Bali (0.04)
  - Singapore (0.04)
- Europe > Belgium
  - Brussels-Capital Region > Brussels (0.04)
- North America > United States
  - New Jersey > Middlesex County
    - New Brunswick (0.04)
  - New York > New York County
    - New York City (0.04)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (1.00)

Industry:
- Government > Regional Government (0.67)
- Information Technology > Security & Privacy (1.00)
- Law Enforcement & Public Safety (0.67)
- Leisure & Entertainment (0.67)
- Media > News (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found