MoS-VLA: A Vision-Language-Action Model with One-Shot Skill Adaptation

Open in new window