Classifier-independent Lower-Bounds for Adversarial Robustness

Nov-9-2020–arXiv.org Machine Learning

We theoretically analyse the limits of robustness to test-time adversarial and noisy examples in classification. Our work focuses on deriving bounds which uniformly apply to all classifiers (i.e all measurable functions from features to labels) for a given problem. Our contributions are two-fold. (1) We use optimal transport theory to derive variational formulae for the Bayes-optimal error a classifier can make on a given classification problem, subject to adversarial attacks. The optimal adversarial attack is then an optimal transport plan for a certain binary cost-function induced by the specific attack model, and can be computed via a simple algorithm based on maximal matching on bipartite graphs. (2) We derive explicit lower-bounds on the Bayes-optimal error in the case of the popular distance-based attacks. These bounds are universal in the sense that they depend on the geometry of the class-conditional distributions of the data, but not on a particular classifier. Our results are in sharp contrast with the existing literature, wherein adversarial vulnerability of classifiers is derived as a consequence of nonzero ordinary test error.

adversarial attack, bayes-optimal error, classifier, (16 more...)

arXiv.org Machine Learning

Nov-9-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Los Angeles County > Long Beach (0.04)
- Asia > Middle East
  - Israel > Haifa District > Haifa (0.04)

Genre:
- Research Report (0.84)

Industry:
- Government > Military (0.56)
- Information Technology > Security & Privacy (0.56)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.93)
  - Machine Learning > Neural Networks (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found