MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Open in new window