CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

Open in new window