Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling

Open in new window