Detection of Iterative Adversarial Attacks via Counter Attack
Rottmann, Matthias, Peyron, Mathis, Krejic, Natasa, Gottschalk, Hanno
Deep neural networks (DNNs) have proven to be powerful tools for processing unstructured data. However for high-dimensional data, like images, they are inherently vulnerable to adversarial attacks. Small almost invisible perturbations added to the input can be used to fool DNNs. Various attacks, hardening methods and detection methods have been introduced in recent years. Notoriously, Carlini-Wagner (CW) type attacks computed by iterative minimization belong to those that are most difficult to detect. In this work, we demonstrate that such iterative minimization attacks can by used as detectors themselves. Thus, in some sense we show that one can fight fire with fire. This work also outlines a mathematical proof that under certain assumptions this detector provides asymptotically optimal separation of original and attacked images. In numerical experiments, we obtain AUROC values up to 99.73% for our detection method. This distinctly surpasses state of the art detection rates for CW attacks from the literature. We also give numerical evidence that our method is robust against the attacker's choice of the method of attack.
Sep-23-2020
- Country:
- North America > United States
- New York (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Serbia > Vojvodina
- South Bačka District > Novi Sad (0.04)
- France > Occitanie
- Haute-Garonne > Toulouse (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology:
- Information Technology
- Security & Privacy (1.00)
- Data Science > Data Mining (0.93)
- Artificial Intelligence > Machine Learning
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (0.66)
- Information Technology