Generative causal explanations of black-box classifiers
–Neural Information Processing Systems
We develop a method for generating causal post-hoc explanations of black-box classifiers based on a learned low-dimensional representation of the data. The explanation is causal in the sense that changing learned latent factors produces a change in the classifier output statistics. To construct these explanations, we design a learning framework that leverages a generative model and information-theoretic measures of causal influence.
Neural Information Processing Systems
Nov-13-2025, 20:46:44 GMT
- Country:
- Asia
- Japan
- Honshū > Kantō
- Ibaraki Prefecture > Tsukuba (0.04)
- Kyūshū & Okinawa > Okinawa (0.04)
- Honshū > Kantō
- South Korea > Seoul
- Seoul (0.04)
- Japan
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- France (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- Denmark > Capital Region
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- British Columbia > Vancouver (0.04)
- Nova Scotia > Halifax Regional Municipality
- Halifax (0.04)
- Quebec > Montreal (0.05)
- Alberta > Census Division No. 15
- United States
- California
- Los Angeles County > Long Beach (0.05)
- San Francisco County > San Francisco (0.14)
- Colorado > Denver County
- Denver (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- California
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia
- Industry:
- Information Technology > Security & Privacy (1.00)
- Law (0.68)
- Transportation > Air (0.62)
- Technology: