Defense Through Diverse Directions

Bender, Christopher M., Li, Yang, Shi, Yifeng, Reiter, Michael K., Oliva, Junier B.

Mar-23-2020–arXiv.org Machine Learning

In this work we develop a novel Bayesian neural network methodology to achieve strong adversarial robustness without the need for online adversarial training. Unlike previous efforts in this direction, we do not rely solely on the stochasticity of network weights by minimizing the divergence between the learned parameter distribution and a prior. Instead, we additionally require that the model maintain some expected uncertainty with respect to all input covariates. We demonstrate that by encouraging the network to distribute evenly across inputs, the network becomes less susceptible to localized, brittle features which imparts a natural robustness to targeted perturbations. We show empirical robustness on several benchmark datasets.

accuracy, adversarial training, penalty, (14 more...)

arXiv.org Machine Learning

Mar-23-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > North Carolina (0.04)
  - Canada
    - Ontario > Toronto (0.04)
    - Alberta > Census Division No. 15
      - Improvement District No. 9 > Banff (0.04)

Genre:
- Research Report (0.64)

Industry:
- Information Technology (0.47)
- Education > Educational Setting (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (1.00)
  - Representation & Reasoning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found