MEME: Generating RNN Model Explanations via Model Extraction

Kazhdan, Dmitry, Dimanov, Botty, Jamnik, Mateja, Liò, Pietro

Dec-12-2020–arXiv.org Artificial Intelligence

Recurrent Neural Networks (RNNs) have achieved remarkable performance on a range of tasks. A key step to further empowering RNN-based approaches is improving their explainability and interpretability. In this work we present MEME: a model extraction approach capable of approximating RNNs with interpretable models represented by human-understandable concepts and their interactions. We demonstrate how MEME can be applied to two multivariate, continuous data case studies: Room Occupation Prediction, and In-Hospital Mortality Prediction. Using these case-studies, we show how our extracted models can be used to interpret RNNs both locally and globally, by approximating RNN decision-making via interpretable concept interactions.

explanation, rnn, transition function, (14 more...)

arXiv.org Artificial Intelligence

Dec-12-2020

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (1.00)
    - Machine Learning
      - Neural Networks > Deep Learning (0.89)
      - Learning Graphical Models > Undirected Networks
        Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found