Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models

May-27-2025, 14:59:32 GMT–Neural Information Processing Systems

While image-to-text models have demonstrated significant advancements in various vision-language tasks, they remain susceptible to adversarial attacks. Existing white-box attacks on image-to-text models require access to the architecture, gradients, and parameters of the target model, resulting in low practicality. Although the recently proposed gray-box attacks have improved practicality, they suffer from semantic loss during the training process, which limits their targeted attack performance. To advance adversarial attacks of image-to-text models, this paper focuses on a challenging scenario: decision-based black-box targeted attacks where the attackers only have access to the final output text and aim to perform targeted attacks. Specifically, we formulate the decision-based black-box targeted attack as a large-scale optimization problem.

attack, effective decision-based black-box targeted attack, textit, (8 more...)

Neural Information Processing Systems

May-27-2025, 14:59:32 GMT

Conferences Web Page

Add feedback

Industry:
- Transportation > Air (0.88)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.81)