ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence Eric Wu* Department of Biomedical Data Science Department of Electrical Engineering Stanford University

Open in new window