SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Open in new window