AutoEval Done Right: Using Synthetic Data for Model Evaluation

Open in new window