Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

Open in new window