ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence