Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models, and Min Jiang
–Neural Information Processing Systems
While image-to-text models have demonstrated significant advancements in various vision-language tasks, they remain susceptible to adversarial attacks. Existing white-box attacks on image-to-text models require access to the architecture, gradients, and parameters of the target model, resulting in low practicality. Although the recently proposed gray-box attacks have improved practicality, they suffer from semantic loss during the training process, which limits their targeted attack performance. To advance adversarial attacks of image-to-text models, this paper focuses on a challenging scenario: decision-based black-box targeted attacks where the attackers only have access to the final output text and aim to perform targeted attacks. Specifically, we formulate the decision-based black-box targeted attack as a large-scale optimization problem.
Neural Information Processing Systems
Mar-27-2025, 06:20:56 GMT
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.67)
- Research Report
- Industry:
- Government (1.00)
- Information Technology > Security & Privacy (0.69)
- Transportation > Air (0.92)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Evolutionary Systems (1.00)
- Neural Networks > Deep Learning (1.00)
- Natural Language > Text Processing (0.95)
- Representation & Reasoning > Optimization (0.88)
- Vision (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence