FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

Open in new window