S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models

Open in new window