Visual S: Sketching as a Visual Chain of Thought for Multimodal Language Models

Open in new window