AMO-Bench: Large Language Models Still Struggle in High School Math Competitions

Open in new window