Evaluating and Understanding the Robustness of Adversarial Logit Pairing

Engstrom, Logan, Ilyas, Andrew, Athalye, Anish

Nov-23-2018–arXiv.org Machine Learning

We evaluate the robustness of Adversarial Logit Pairing, a recently proposed defense againstadversarial examples. We find that a network trained with Adversarial Logit Pairing achieves 0.6% correct classification rate under targeted adversarial attack, the threat model in which the defense is considered. We provide a brief overview of the defense and the threat models/claims considered, as well as a discussion of the methodology and results of our attack. Our results offer insights into the reasons underlying the vulnerability of ALP to adversarial attack, and are of general interest in evaluating and understanding adversarial defenses.

adversarial example, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

Nov-23-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre:
- Research Report (0.72)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found