Very fast, approximate counterfactual explanations for decision forests
Carreira-Perpiñán, Miguel Á., Hada, Suryabhan Singh
–arXiv.org Artificial Intelligence
We consider finding a counterfactual explanation for a classification or regression forest, such as a random forest. This requires solving an optimization problem to find the closest input instance to a given instance for which the forest outputs a desired value. Finding an exact solution has a cost that is exponential on the number of leaves in the forest. We propose a simple but very effective approach: we constrain the optimization to only those input space regions defined by the forest that are populated by actual data points. The problem reduces to a form of nearest-neighbor search using a certain distance on a certain dataset. This has two advantages: first, the solution can be found very quickly, scaling to large forests and high-dimensional data, and enabling interactive use. Second, the solution found is more likely to be realistic in that it is guided towards high-density areas of input space.
arXiv.org Artificial Intelligence
Mar-5-2023
- Country:
- Asia > India
- Europe
- Italy (0.04)
- Spain > Basque Country
- Biscay Province > Bilbao (0.04)
- North America
- Canada
- United States
- California
- Merced County > Merced (0.04)
- San Diego County > San Diego (0.04)
- District of Columbia > Washington (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Washington > King County
- Seattle (0.04)
- California
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine > Therapeutic Area > Oncology (0.47)
- Technology: