Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections
Zhang, Xin, Solar-Lezama, Armando, Singh, Rishabh
–Neural Information Processing Systems
We present a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU activations to change its output. We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfaction problems. The technique is evaluated on three neural network models: one predicting whether an applicant will pay a mortgage, one predicting whether a first-order theorem can be proved efficiently by a solver using certain heuristics, and the final one judging whether a drawing is an accurate rendition of a canonical drawing of a cat.
Neural Information Processing Systems
Dec-31-2018
- Country:
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Spain > Catalonia
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Arizona > Maricopa County
- Scottsdale (0.04)
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- Nebraska (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.04)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Texas > Travis County
- Austin (0.04)
- Arizona > Maricopa County
- Canada > Quebec
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Europe
- Industry:
- Banking & Finance > Loans (0.70)