ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence

Oct-9-2025, 23:44:21 GMT–Neural Information Processing Systems

GPT -4o, on this dataset and find that LLMs are susceptible to adopting incorrect retrieved content, overriding their own correct prior knowledge over 60% of the time. However, the more unrealistic the retrieved content is (i.e. more deviated from

dataset, information, probability, (15 more...)

Neural Information Processing Systems

Oct-9-2025, 23:44:21 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Puerto Rico (0.04)
  - Canada (0.04)
  - United States
    - Wisconsin (0.04)
    - Minnesota (0.04)
    - Massachusetts (0.04)
    - California > Santa Clara County
      - Stanford (0.04)
      - Palo Alto (0.04)
- Asia
  - Middle East > Iran (0.04)
  - Japan > Honshū
    - Tōhoku > Iwate Prefecture > Morioka (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Leisure & Entertainment > Sports (0.68)
- Health & Medicine (0.46)
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence

Similar Docs Excel Report more

Title	Similarity	Source
None found