Spatial Reasoning in Multimodal Large Language Models: A Survey of Tasks, Benchmarks and Methods

Open in new window