Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models

Open in new window