Chain of Thought Prompt Tuning in Vision Language Models

Open in new window