Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections
Zhang, Xin, Solar-Lezama, Armando, Singh, Rishabh
The paper describes a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU neurons to change its output. We argue that such a correction is a useful way to provide feedback to a user when the neural network produces an output that is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfaction problems. The technique is evaluated on a neural network that has been trained to predict whether an applicant will pay a mortgage.
Feb-20-2018
- Country:
- Europe > Switzerland
- North America
- Canada > Quebec (0.14)
- United States
- California > San Francisco County
- San Francisco (0.14)
- Massachusetts (0.14)
- New York (0.14)
- Texas (0.14)
- California > San Francisco County
- Genre:
- Research Report (0.50)
- Industry:
- Banking & Finance
- Loans > Mortgages (0.46)
- Real Estate (0.68)
- Banking & Finance