ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence
–Neural Information Processing Systems
GPT -4o, on this dataset and find that LLMs are susceptible to adopting incorrect retrieved content, overriding their own correct prior knowledge over 60% of the time. However, the more unrealistic the retrieved content is (i.e. more deviated from
Neural Information Processing Systems
Feb-11-2026, 10:26:56 GMT
- Country:
- North America
- Puerto Rico (0.04)
- Canada (0.04)
- United States
- Wisconsin (0.04)
- Minnesota (0.04)
- Massachusetts (0.04)
- California > Santa Clara County
- Asia
- Middle East > Iran (0.04)
- Japan > Honshū
- Tōhoku > Iwate Prefecture > Morioka (0.04)
- North America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Leisure & Entertainment > Sports (0.68)
- Health & Medicine (0.46)
- Government (0.46)
- Technology: