Reliable Fine-Grained Evaluation of Natural Language Math Proofs

Open in new window