JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models

Open in new window