Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning

Open in new window