Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction

Open in new window