Evasion and Hardening of Tree Ensemble Classifiers
Kantchelian, Alex, Tygar, J. D., Joseph, Anthony D.
Classifier evasion consists in finding for a given instance $x$ the nearest instance $x'$ such that the classifier predictions of $x$ and $x'$ are different. We present two novel algorithms for systematically computing evasions for tree ensembles such as boosted trees and random forests. Our first algorithm uses a Mixed Integer Linear Program solver and finds the optimal evading instance under an expressive set of constraints. Our second algorithm trades off optimality for speed by using symbolic prediction, a novel algorithm for fast finite differences on tree ensembles. On a digit recognition task, we demonstrate that both gradient boosted trees and random forests are extremely susceptible to evasions. Finally, we harden a boosted tree model without loss of predictive accuracy by augmenting the training set of each boosting round with evading instances, a technique we call adversarial boosting.
May-26-2016
- Country:
- Genre:
- Research Report (0.64)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence > Machine Learning
- Ensemble Learning (1.00)
- Decision Tree Learning (1.00)
- Statistical Learning (0.94)
- Performance Analysis > Accuracy (0.34)
- Information Technology > Artificial Intelligence > Machine Learning