ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning

Open in new window