On Measuring Intrinsic Causal Attributions in Deep Neural Networks

Saha, Saptarshi, Rathore, Dhruv Vansraj, Saha, Soumadeep, Garain, Utpal, Doermann, David

May-16-2025–arXiv.org Machine Learning

Quantifying the causal influence of input features within neural networks has become a topic of increasing interest. Existing approaches typically assess direct, indirect, and total causal effects. This work treats NNs as structural causal models (SCMs) and extends our focus to include intrinsic causal contributions (ICC). We propose an identifiable generative post-hoc framework for quantifying ICC. We also draw a relationship between ICC and Sobol' indices. Our experiments on synthetic and real-world datasets demonstrate that ICC generates more intuitive and reliable explanations compared to existing global explanation techniques.

contribution, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

May-16-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Palau (0.04)
- North America > United States
  - New York > Erie County > Buffalo (0.04)
- Europe
  - Switzerland (0.04)
  - Spain (0.04)
  - Russia (0.04)
  - Austria (0.04)
  - United Kingdom > England
    - Tyne and Wear > Newcastle (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
- Asia
  - Russia (0.04)
  - India > West Bengal
    - Kolkata (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.82)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found