Query-Efficient Black-box Adversarial Examples

Ilyas, Andrew, Engstrom, Logan, Athalye, Anish, Lin, Jessy

Dec-19-2017–arXiv.org Machine Learning

Current neural network-based image classifiers are susceptible to adversarial examples, even in the black-box setting, where the attacker is limited to query access without access to gradients. Previous methods --- substitute networks and coordinate-based finite-difference methods --- are either unreliable or query-inefficient, making these methods impractical for certain problems. We introduce a new method for reliably generating adversarial examples under more restricted, practical black-box threat models. First, we apply natural evolution strategies to perform black-box attacks using two to three orders of magnitude fewer queries than previous methods. Second, we introduce a new algorithm to perform targeted adversarial attacks in the partial-information setting, where the attacker only has access to a limited number of target classes. Using these techniques, we successfully perform the first targeted adversarial attack against a commercially deployed machine learning system, the Google Cloud Vision API, in the partial information setting.

adversarial example, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

Dec-19-2017

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)

Genre:
- Research Report (0.82)

Industry:
- Transportation > Air (1.00)
- Information Technology > Security & Privacy (0.69)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found