Generative causal explanations of black-box classifiers

Oct-2-2025, 17:09:03 GMT–Neural Information Processing Systems

We develop a method for generating causal post-hoc explanations of black-box classifiers based on a learned low-dimensional representation of the data. The explanation is causal in the sense that changing learned latent factors produces a change in the classifier output statistics. To construct these explanations, we design a learning framework that leverages a generative model and information-theoretic measures of causal influence.

explanation, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Oct-2-2025, 17:09:03 GMT

Conferences PDF

Add feedback

Country:
- Europe (1.00)
- North America
  - Canada (0.95)
  - United States > California (0.47)

Industry:
- Information Technology > Security & Privacy (1.00)
- Transportation > Air (0.62)

Technology:
- Information Technology
  - Security & Privacy (0.93)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (0.89)
    - Machine Learning
      - Statistical Learning (0.68)
      - Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
3a93a609b97ec0ab0ff5539eb79ef33a-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found