Visual CoT Makes VLMs Smarter but More Fragile

Open in new window