Evasion and Hardening of Tree Ensemble Classifiers

Kantchelian, Alex, Tygar, J. D., Joseph, Anthony D.

May-26-2016–arXiv.org Machine Learning

Classifier evasion consists in finding for a given instance $x$ the nearest instance $x'$ such that the classifier predictions of $x$ and $x'$ are different. We present two novel algorithms for systematically computing evasions for tree ensembles such as boosted trees and random forests. Our first algorithm uses a Mixed Integer Linear Program solver and finds the optimal evading instance under an expressive set of constraints. Our second algorithm trades off optimality for speed by using symbolic prediction, a novel algorithm for fast finite differences on tree ensembles. On a digit recognition task, we demonstrate that both gradient boosted trees and random forests are extremely susceptible to evasions. Finally, we harden a boosted tree model without loss of predictive accuracy by augmenting the training set of each boosting round with evading instances, a technique we call adversarial boosting.

artificial intelligence, constraint, machine learning, (16 more...)

arXiv.org Machine Learning

May-26-2016

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Research Report (0.64)

Industry:
- Information Technology > Security & Privacy (0.46)
- Government > Regional Government
  - North America Government > United States Government (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Ensemble Learning (1.00)
  - Decision Tree Learning (1.00)
  - Statistical Learning (0.94)
  - Performance Analysis > Accuracy (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found