Digital Socrates: Evaluating LLMs through explanation critiques