JudgeBench: A Benchmark for Evaluating LLM-based Judges

Open in new window