Evaluating LLMs at Detecting Errors in LLM Responses

Open in new window