RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Open in new window