Interpretable Regional Descriptors: Hyperbox-Based Local Explanations

Dandl, Susanne, Casalicchio, Giuseppe, Bischl, Bernd, Bothmann, Ludwig

May-4-2023–arXiv.org Machine Learning

This work introduces interpretable regional descriptors, or IRDs, for local, model-agnostic interpretations. IRDs are hyperboxes that describe how an observation's feature values can be changed without affecting its prediction. They justify a prediction by providing a set of "even if" arguments (semi-factual explanations), and they indicate which features affect a prediction and whether pointwise biases or implausibilities exist. A concrete use case shows that this is valuable for both machine learning modelers and persons subject to a decision. We formalize the search for IRDs as an optimization problem and introduce a unifying framework for computing IRDs that covers desiderata, initialization techniques, and a post-processing method. We show how existing hyperbox methods can be adapted to fit into this unified framework. A benchmark study compares the methods based on several quality measures and identifies two strategies to improve IRDs.

artificial intelligence, machine learning, optimization problem, (13 more...)

arXiv.org Machine Learning

May-4-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Arizona > Maricopa County > Phoenix (0.04)
- Europe
  - Switzerland (0.04)
  - United Kingdom > Scotland
    - City of Glasgow > Glasgow (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)

Genre:
- Research Report > Experimental Study (0.46)

Industry:
- Banking & Finance (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Optimization (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found