What Evidence Do Language Models Find Convincing?