Evaluating Superhuman Models with Consistency Checks

Open in new window