Replace-then-Perturb: Targeted Adversarial Attacks With Visual Reasoning for Vision-Language Models

Open in new window