A Formal Approach to Explainability
Wolf, Lior, Galanti, Tomer, Hazan, Tamir
We regard explanations as a blending of the input sample and the model's output and offer a few definitions that capture various desired properties of the function that generates t hese explanations. We study the links between these properties a nd between explanation-generating functions and intermedia te representations of learned models and are able to show, for example, that if the activations of a given layer are consist ent with an explanation, then so do all other subsequent layers. In addition, we study the intersection and union of explanatio ns as a way to construct new explanations.
Jan-15-2020
- Country:
- North America > United States (0.04)
- Asia > Middle East
- Israel > Tel Aviv District > Tel Aviv (0.04)
- Genre:
- Research Report (0.50)
- Technology: