Stabilizing Estimates of Shapley Values with Control Variates

Nov-9-2023–arXiv.org Machine Learning

In layman's terms, Shapley values quantify how much information is gained from being told the value of each feature. Shapley values are among the most popular tools for explaining predictions of blackbox Shapley values are rarely computed exactly, as the machine learning models. However, their computational cost is exponential in the number of high computational cost motivates the use input features. Rather, they are typically estimated of sampling approximations, inducing a considerable using the Shapley Sampling or KernelSHAP algorithm degree of uncertainty. To stabilize (Lundberg and Lee, 2017; Strumbelj and Kononenko, these model explanations, we propose ControlSHAP, 2010, 2014). These algorithms, however, are subject an approach based on the Monte to sampling variability; as a result, running the same Carlo technique of control variates. Our procedure twice may yield different estimated Shapley methodology is applicable to any machine values, including different estimated orderings of learning model and requires virtually no extra features. This instability raises questions about the computation or modeling effort. On several trustworthiness of insights gleaned from Shapley values.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

Nov-9-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States > California > San Francisco County > San Francisco (0.14)

Genre:
- Research Report (0.85)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Neural Networks (0.69)
  - Data Science > Data Mining (1.00)
  - Game Theory (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found