Very fast, approximate counterfactual explanations for decision forests

Carreira-Perpiñán, Miguel Á., Hada, Suryabhan Singh

Mar-5-2023–arXiv.org Artificial Intelligence

We consider finding a counterfactual explanation for a classification or regression forest, such as a random forest. This requires solving an optimization problem to find the closest input instance to a given instance for which the forest outputs a desired value. Finding an exact solution has a cost that is exponential on the number of leaves in the forest. We propose a simple but very effective approach: we constrain the optimization to only those input space regions defined by the forest that are populated by actual data points. The problem reduces to a form of nearest-neighbor search using a certain distance on a certain dataset. This has two advantages: first, the solution can be found very quickly, scaling to large forests and high-dimensional data, and enabling interactive use. Second, the solution found is more likely to be realistic in that it is guided towards high-density areas of input space.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Mar-5-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - District of Columbia > Washington (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
    - California
      - San Diego County > San Diego (0.04)
      - Merced County > Merced (0.04)
  - Canada
    - Ontario > Toronto (0.04)
    - Nova Scotia > Halifax Regional Municipality
      - Halifax (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
    - Alberta > Census Division No. 15
      - Improvement District No. 9 > Banff (0.04)
- Europe
  - Italy (0.04)
  - Spain > Basque Country
    - Biscay Province > Bilbao (0.04)
- Asia > India
  - Telangana > Hyderabad (0.04)

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine > Therapeutic Area > Oncology (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language (1.00)
  - Machine Learning > Decision Tree Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found