LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

Open in new window