C2F-Space: Coarse-to-Fine Space Grounding for Spatial Instructions using Vision-Language Models

Open in new window