Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

Open in new window