Generative causal explanations of black-box classifiers
–Neural Information Processing Systems
We develop a method for generating causal post-hoc explanations of black-box classifiers based on a learned low-dimensional representation of the data. The explanation is causal in the sense that changing learned latent factors produces a change in the classifier output statistics. To construct these explanations, we design a learning framework that leverages a generative model and information-theoretic measures of causal influence.
Neural Information Processing Systems
Oct-2-2025, 17:09:03 GMT
- Country:
- Europe (1.00)
- North America
- Canada (0.95)
- United States > California (0.47)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Transportation > Air (0.62)
- Technology: