Explanation-Based Training with Differentiable Insertion/Deletion Metric-Aware Regularizers

Oct-19-2023–arXiv.org Machine Learning

The quality of explanations for the predictions of complex machine learning predictors is often measured using insertion and deletion metrics, which assess the faithfulness of the explanations, i.e., how correctly the explanations reflect the predictor's behavior. To improve the faithfulness, we propose insertion/deletion metric-aware explanation-based optimization (ID-ExpO), which optimizes differentiable predictors to improve both insertion and deletion scores of the explanations while keeping their predictive accuracy. Since the original insertion and deletion metrics are indifferentiable with respect to the explanations and directly unavailable for gradient-based optimization, we extend the metrics to be differentiable and use them to formalize insertion and deletion metric-based regularizers. The experimental results on image and tabular datasets show that the deep neural networks-based predictors fine-tuned using ID-ExpO enable popular post-hoc explainers to produce more faithful and easy-to-interpret explanations while keeping high predictive accuracy.

artificial intelligence, explanation, machine learning, (14 more...)

arXiv.org Machine Learning

Oct-19-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Florida > Broward County > Fort Lauderdale (0.04)
- Europe > Italy
  - Marche > Ancona Province > Ancona (0.04)

Genre:
- Research Report (0.82)

Industry:
- Information Technology (0.93)
- Transportation (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Neural Networks > Deep Learning (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found