Very fast, approximate counterfactual explanations for decision forests
Carreira-Perpiñán, Miguel Á., Hada, Suryabhan Singh
–arXiv.org Artificial Intelligence
We consider finding a counterfactual explanation for a classification or regression forest, such as a random forest. This requires solving an optimization problem to find the closest input instance to a given instance for which the forest outputs a desired value. Finding an exact solution has a cost that is exponential on the number of leaves in the forest. We propose a simple but very effective approach: we constrain the optimization to only those input space regions defined by the forest that are populated by actual data points. The problem reduces to a form of nearest-neighbor search using a certain distance on a certain dataset. This has two advantages: first, the solution can be found very quickly, scaling to large forests and high-dimensional data, and enabling interactive use. Second, the solution found is more likely to be realistic in that it is guided towards high-density areas of input space.
arXiv.org Artificial Intelligence
Mar-5-2023
- Country:
- North America
- Canada (0.93)
- United States > California (0.28)
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine > Therapeutic Area > Oncology (0.47)
- Technology: