GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning

Open in new window