taylor decomposition
Elucidating Discrepancy in Explanations of Predictive Models Developed using EMR
Brankovic, Aida, Huang, Wenjie, Cook, David, Khanna, Sankalp, Bialkowski, Konstanty
The lack of transparency and explainability hinders the clinical adoption of Machine learning (ML) algorithms. While explainable artificial intelligence (XAI) methods have been proposed, little research has focused on the agreement between these methods and expert clinical knowledge. This study applies current state-of-the-art explainability methods to clinical decision support algorithms developed for Electronic Medical Records (EMR) data to analyse the concordance between these factors and discusses causes for identified discrepancies from a clinical and technical perspective. Important factors for achieving trustworthy XAI solutions for clinical decision support are also discussed.
Harmonic Decompositions of Convolutional Networks
Scetbon, Meyer, Harchaoui, Zaid
The renewed interest in convolutional neural networks [12, 15] in computer vision and signal processing has lead to a major leap in generalization performance on common task benchmarks, supported by the recent advances in graphical processing hardware and the collection of huge labelled datasets for training and evaluation. Convolutional neural networks pose major a challenge to statistical learning theory. First and foremost a convolutional network learns from data, jointly, both a feature representation through its hidden layers and a prediction function through its ultimate layer. A convolutional neural network implements a function unfolding as a composition of basic functions (respectively nonlinearity, convolution, and pooling), which appear to model well visual information in images. Yet the relevant function spaces to analyze their statistical performance remain unclear. The analysis of convolutional neural networks (CNNs) has been an active research topic. Different viewpoints have been developed. A straightforward viewpoint is to dismiss completely the grid-or latticestructure of images and analyze a multi-layer perceptron (MLP) instead acting on vectorized images, which has the downside the set aside the most interesting property CNNs which is to model well images that is data with a 2D lattice structure.
Toward Interpretable Machine Learning: Transparent Deep Neural Networks and Beyond
Samek, Wojciech, Montavon, Grégoire, Lapuschkin, Sebastian, Anders, Christopher J., Müller, Klaus-Robert
With the broader and highly successful usage of machine learning in industry and the sciences, there has been a growing demand for explainable AI. Interpretability and explanation methods for gaining a better understanding about the problem solving abilities and strategies of nonlinear Machine Learning such as Deep Learning (DL), LSTMs, and kernel methods are therefore receiving increased attention. In this work we aim to (1) provide a timely overview of this active emerging field and explain its theoretical foundations, (2) put interpretability algorithms to a test both from a theory and comparative evaluation perspective using extensive simulations, (3) outline best practice aspects i.e. how to best include interpretation methods into the standard usage of machine learning and (4) demonstrate successful usage of explainable AI in a representative selection of application scenarios. Finally, we discuss challenges and possible future directions of this exciting foundational field of machine learning.
Advances on interpretability of deep Neural Nets at ICIAM 2019
An introduction to different methods for Interpretability can be found here. During the ICIAM Theoretical advances of deep learning mini-symposia, there were some talks on interpretability, perhaps the most interesting ones were by Wojciech Samek, Fraunhofer Heinrich Hertz Institute, and by Stephan Waeldchen, Technische Universität Berlin. The first talk debated how LRP can be understood as a deep Taylor decomposition of the prediction. Some more information and tutorials on these can be found on their webpage. One of the methods to study the interpretability of a net is sensitivity analysis. For this, the changes of the gradient are used to decompose the neural net, however, the gradient is unreliable.
Advances on interpretability of deep Neural Nets at ICIAM 2019
An introduction to different methods for Interpretability can be found here. During the ICIAM Theoretical advances of deep learning mini-symposia, there were some talks on interpretability, perhaps the most interesting ones were by Wojciech Samek, Fraunhofer Heinrich Hertz Institute, and by Stephan Waeldchen, Technische Universität Berlin. The first talk debated how LRP can be understood as a deep Taylor decomposition of the prediction. Some more information and tutorials on these can be found on their webpage. One of the methods to study the interpretability of a net is sensitivity analysis. For this, the changes of the gradient are used to decompose the neural net, however, the gradient is unreliable.
Discriminating Spatial and Temporal Relevance in Deep Taylor Decompositions for Explainable Activity Recognition
Hiley, Liam, Preece, Alun, Hicks, Yulia, Marshall, David, Taylor, Harrison
Current techniques for explainable AI have been applied with some success to image processing. The recent rise of research in video processing has called for similar work n deconstructing and explaining spatio-temporal models. While many techniques are designed for 2D convolutional models, others are inherently applicable to any input domain. One such body of work, deep Taylor decomposition, propagates relevance from the model output distributively onto its input and thus is not restricted to image processing models. However, by exploiting a simple technique that removes motion information, we show that it is not the case that this technique is effective as-is for representing relevance in non-image tasks. We instead propose a discriminative method that produces a na\"ive representation of both the spatial and temporal relevance of a frame as two separate objects. This new discriminative relevance model exposes relevance in the frame attributed to motion, that was previously ambiguous in the original explanation. We observe the effectiveness of this technique on a range of samples from the UCF-101 action recognition dataset, two of which are demonstrated in this paper.
Explaining a prediction in some nonlinear models
In this article we will analyse how to compute the contribution of each input value to its aggregate output in some nonlinear models. Regression and classification applications, together with related algorithms for deep neural networks are presented. The proposed approach merges two methods currently present in the literature: integrated gradient and deep Taylor decomposition.
Towards Explaining Anomalies: A Deep Taylor Decomposition of One-Class Models
Kauffmann, Jacob, Müller, Klaus-Robert, Montavon, Grégoire
One such application is intrusion detection in computer systems, where data points are typically digital messages transmitted over a network, and messages that are detected as outliers are considered likely to carry a threat [13, 17]. Another application is obstacle detection in autonomous car driving [18]. The ability to detect outliers is also important in scientific applications, where points detected as such are intrinsically more interesting than inliers, and should therefore be given more attention [59, 28]. A number of techniques can be used for outlier detection [12, 21, 36, 41, 51]. In practice, it is not only important to be able to detect outliers and inliers with high accuracy, one would also like to be able to explain why a machine learning model considers a sample as inlier or outlier. An interpretable explanatory feedback can indeed be used by a human operator for appropriate decision making. The data point could either be considered as benign and possibly incorporated to the dataset, or appropriate action might be taken. The problem of outlier explanation is shown schematically in Figure 1.
Methods for Interpreting and Understanding Deep Neural Networks
Montavon, Grégoire, Samek, Wojciech, Müller, Klaus-Robert
This paper provides an entry point to the problem of interpreting a deep neural network model and explaining its predictions. It is based on a tutorial given at ICASSP 2017. It introduces some recently proposed techniques of interpretation, along with theory, tricks and recommendations, to make most efficient use of these techniques on real data. It also discusses a number of practical applications.
Explaining NonLinear Classification Decisions with Deep Taylor Decomposition
Montavon, Grégoire, Bach, Sebastian, Binder, Alexander, Samek, Wojciech, Müller, Klaus-Robert
Nonlinear methods such as Deep Neural Networks (DNNs) are the gold standard for various challenging machine learning problems, e.g., image classification, natural language processing or human action recognition. Although these methods perform impressively well, they have a significant disadvantage, the lack of transparency, limiting the interpretability of the solution and thus the scope of application in practice. Especially DNNs act as black boxes due to their multilayer nonlinear structure. In this paper we introduce a novel methodology for interpreting generic multilayer neural networks by decomposing the network classification decision into contributions of its input elements. Although our focus is on image classification, the method is applicable to a broad set of input data, learning tasks and network architectures. Our method is based on deep Taylor decomposition and efficiently utilizes the structure of the network by backpropagating the explanations from the output to the input layer. We evaluate the proposed method empirically on the MNIST and ILSVRC data sets.