Detecting Backdoors in Neural Networks Using Novel Feature-Based Anomaly Detection
Fu, Hao, Veldanda, Akshaj Kumar, Krishnamurthy, Prashanth, Garg, Siddharth, Khorrami, Farshad
–arXiv.org Artificial Intelligence
This paper proposes a new defense against neural network backdooring attacks that are maliciously trained to mispredict in the presence of attacker-chosen triggers. Our defense is based on the intuition that the feature extraction layers of a backdoored network embed new features to detect the presence of a trigger and the subsequent classification layers learn to mispredict when triggers are detected. Therefore, to detect backdoors, the proposed defense uses two synergistic anomaly detectors trained on clean validation data: the first is a novelty detector that checks for anomalous features, while the second detects anomalous mappings from features to outputs by comparing with a separate classifier trained on validation data. The approach is evaluated on a wide range of backdoored networks (with multiple variations of triggers) that successfully evade state-of-the-art defenses. Additionally, we evaluate the robustness of our approach on imperceptible perturbations, scalability on large-scale datasets, and effectiveness under domain shift. This paper also shows that the defense can be further improved using data augmentation.
arXiv.org Artificial Intelligence
Nov-4-2020
- Country:
- Asia
- Europe
- France > Auvergne-Rhône-Alpes
- Greece (0.04)
- Spain > Galicia
- Madrid (0.04)
- Sweden (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 15
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- California
- Los Angeles County > Los Angeles (0.14)
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- Santa Clara County > San Jose (0.04)
- Colorado > El Paso County
- Colorado Springs (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York > Kings County
- New York City (0.04)
- Maryland > Baltimore (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Texas > Dallas County
- Dallas (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- California
- Canada
- South America > Chile
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: