Detecting Backdoors in Neural Networks Using Novel Feature-Based Anomaly Detection

Fu, Hao, Veldanda, Akshaj Kumar, Krishnamurthy, Prashanth, Garg, Siddharth, Khorrami, Farshad

Nov-4-2020–arXiv.org Artificial Intelligence

This paper proposes a new defense against neural network backdooring attacks that are maliciously trained to mispredict in the presence of attacker-chosen triggers. Our defense is based on the intuition that the feature extraction layers of a backdoored network embed new features to detect the presence of a trigger and the subsequent classification layers learn to mispredict when triggers are detected. Therefore, to detect backdoors, the proposed defense uses two synergistic anomaly detectors trained on clean validation data: the first is a novelty detector that checks for anomalous features, while the second detects anomalous mappings from features to outputs by comparing with a separate classifier trained on validation data. The approach is evaluated on a wide range of backdoored networks (with multiple variations of triggers) that successfully evade state-of-the-art defenses. Additionally, we evaluate the robustness of our approach on imperceptible perturbations, scalability on large-scale datasets, and effectiveness under domain shift. This paper also shows that the defense can be further improved using data augmentation.

artificial intelligence, data mining, machine learning, (14 more...)

arXiv.org Artificial Intelligence

Nov-4-2020

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America
  - United States
    - Maryland > Baltimore (0.04)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - Texas > Dallas County
      - Dallas (0.04)
    - Ohio > Franklin County
      - Columbus (0.04)
    - New York > Kings County
      - New York City (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
    - Colorado > El Paso County
      - Colorado Springs (0.04)
    - California
      - San Francisco County > San Francisco (0.14)
      - Los Angeles County > Los Angeles (0.14)
      - Santa Clara County > San Jose (0.04)
      - San Diego County > San Diego (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
    - Alberta > Census Division No. 15
      - Improvement District No. 9 > Banff (0.04)
- Europe
  - Sweden (0.04)
  - Greece (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
  - Spain > Galicia
    - Madrid (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)
- Asia
  - Macao (0.04)
  - China (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Anomaly Detection (1.00)
  - Artificial Intelligence > Machine Learning
    - Performance Analysis > Accuracy (0.72)
    - Neural Networks > Deep Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found