Inspecting adversarial examples using the Fisher information

Sep-12-2019–arXiv.org Artificial Intelligence

Adversarial examples are slight perturbations that are designed to fool artificial neural networks when fed as an input. In this work the usability of the Fisher information for the detection of such adversarial attacks is studied. We discuss various quantities whose computation scales well with the network size, study their behavior on adversarial examples and show how they can highlight the importance of single input neurons, thereby providing a visual tool for further analyzing (un-)reasonable behavior of a neural network. The potential of our methods is demonstrated by applications to the MNIST, CIFAR10 and Fruits-360 datasets.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Sep-12-2019

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.64)

Industry:
- Information Technology > Security & Privacy (0.38)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Performance Analysis > Accuracy (0.94)
  - Neural Networks > Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found