Improving Black-box Adversarial Attacks with a Transfer-based Prior

Cheng, Shuyu, Dong, Yinpeng, Pang, Tianyu, Su, Hang, Zhu, Jun

Jun-17-2019–arXiv.org Machine Learning

We consider the black-box adversarial setting, where the adversary has to generate adversarial perturbations without access to the target models to compute gradients. Previous methods tried to approximate the gradient either by using a transfer gradient of a surrogate white-box model, or based on the query feedback. However, these methods often suffer from low attack success rates or poor query efficiency since it is non-trivial to estimate the gradient in a high-dimensional space with limited information. To address these problems, we propose a prior-guided random gradient-free (P-RGF) method to improve black-box adversarial attacks, which takes the advantage of a transfer-based prior and the query information simultaneously. The transfer-based prior given by the gradient of a surrogate model is appropriately integrated into our algorithm by an optimal coefficient derived by a theoretical analysis. Extensive experiments demonstrate that our method requires much fewer queries to attack black-box models with higher success rates compared with the alternative state-of-the-art methods.

artificial intelligence, machine learning, optimization problem, (18 more...)

arXiv.org Machine Learning

Jun-17-2019

arXiv.org PDF

Add feedback

Country:
- Asia (0.46)

Genre:
- Research Report (1.00)

Industry:
- Transportation > Air (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Military (0.71)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Optimization (0.68)
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found