Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust

Open in new window