The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems

Open in new window