SIM-Shapley: A Stable and Computationally Efficient Approach to Shapley Value Approximation

Fan, Wangxuan, Li, Siqi, Zhou, Doudou, Okada, Yohei, Hong, Chuan, Liu, Molei, Liu, Nan

May-14-2025–arXiv.org Machine Learning

Explainable artificial intelligence (XAI) is essential for trustworthy machine learning (ML), particularly in high-stakes domains such as healthcare and finance. Shapley value (SV) methods provide a principled framework for feature attribution in complex models but incur high computational costs, limiting their scalability in high-dimensional settings. We propose Stochastic Iterative Momentum for Shapley Value Approximation (SIM-Shapley), a stable and efficient SV approximation method inspired by stochastic optimization. We analyze variance theoretically, prove linear $Q$-convergence, and demonstrate improved empirical stability and low bias in practice on real-world datasets. In our numerical experiments, SIM-Shapley reduces computation time by up to 85% relative to state-of-the-art baselines while maintaining comparable feature attribution quality. Beyond feature attribution, our stochastic mini-batch iterative framework extends naturally to a broader class of sample average approximation problems, offering a new avenue for improving computational efficiency with stability guarantees. Code is publicly available at https://github.com/nliulab/SIM-Shapley.

artificial intelligence, machine learning, optimization problem, (15 more...)

arXiv.org Machine Learning

May-14-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada > Ontario (0.04)
  - United States
    - New York (0.04)
    - California
      - San Bernardino County > Ontario (0.04)
      - Los Angeles County > Santa Monica (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Netherlands > South Holland
    - Dordrecht (0.04)
- Asia
  - Singapore (0.04)
  - China > Beijing
    - Beijing (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.88)
  - Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found