The (Un)reliability of saliency methods

Kindermans, Pieter-Jan, Hooker, Sara, Adebayo, Julius, Alber, Maximilian, Schütt, Kristof T., Dähne, Sven, Erhan, Dumitru, Kim, Been

Nov-2-2017–arXiv.org Machine Learning

Saliency methods aim to explain the predictions of deep neural networks. These methods lack reliability when the explanation is sensitive to factors that do not contribute to the model prediction. We use a simple and common pre-processing step --adding a constant shift to the input data-- to show that a transformation with no effect on the model can cause numerous methods to incorrectly attribute. In order to guarantee reliability, we posit that methods should fulfill input invariance, the requirement that a saliency method mirror the sensitivity of the model with respect to transformations of the input. We show, through several examples, that saliency methods that do not satisfy input invariance result in misleading attribution. Research to address this tension is urgently needed; reliable explanations build trust with users, help identify points of model failure and remove barriers to entry for the deployment of deep neural networks in domains like health care, security and transportation. In deep neural networks, data representation is delegated to the model and subsequently we cannot generally say in an informative way what led to a model prediction.

artificial intelligence, input invariance, machine learning, (18 more...)

arXiv.org Machine Learning

Nov-2-2017

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found