SpaRE: Enhancing Spatial Reasoning in Vision-Language Models with Synthetic Data

Open in new window