The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate

Open in new window