Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

Zhang, Xin, Solar-Lezama, Armando, Singh, Rishabh

Dec-31-2018–Neural Information Processing Systems

We present a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU activations to change its output. We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfaction problems. The technique is evaluated on three neural network models: one predicting whether an applicant will pay a mortgage, one predicting whether a first-order theorem can be proved efficiently by a solver using certain heuristics, and the final one judging whether a drawing is an accurate rendition of a canonical drawing of a cat.

correction, deep learning, neural network, (19 more...)

Neural Information Processing Systems

Dec-31-2018

Conferences PDF

Add feedback

Country:
- Europe > Switzerland
  - Zürich > Zürich (0.14)
- North America > United States
  - California > San Francisco County > San Francisco (0.14)

Industry:
- Banking & Finance > Loans (0.70)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)
    - Neural Networks > Deep Learning (0.68)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections
Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

Similar Docs Excel Report more

Title	Similarity	Source
None found