RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought

Open in new window