On Robustness of Vision-Language-Action Model against Multi-Modal Perturbations

Open in new window