Learning Black-Box Attackers with Transferable Priors and Query Feedback

Oct-10-2024, 18:43:32 GMT–Neural Information Processing Systems

This paper addresses the challenging black-box adversarial attack problem, where only classification confidence of a victim model is available. Inspired by consistency of visual saliency between different vision models, a surrogate model is expected to improve the attack performance via transferability. By combining transferability-based and query-based black-box attack, we propose a surprisingly simple baseline approach (named SimBA) using the surrogate model, which significantly outperforms several state-of-the-art methods. Moreover, to efficiently utilize the query feedback, we update the surrogate model in a novel learning scheme, named High-Order Gradient Approximation (HOGA). By constructing a high-order gradient computation graph, we update the surrogate model to approximate the victim model in both forward and backward pass.

learning black-box attacker, query feedback, surrogate model, (2 more...)

Neural Information Processing Systems

Oct-10-2024, 18:43:32 GMT

Conferences Web Page

Add feedback

Industry:
- Transportation > Air (0.95)

Technology:
- Information Technology > Artificial Intelligence (0.43)