Modelling Agent Policies with Interpretable Imitation Learning

Bewley, Tom, Lawry, Jonathan, Richards, Arthur

Jun-19-2020–arXiv.org Artificial Intelligence

As we deploy autonomous agents in safety-critical domains, it becomes important to develop an understanding of their internal mechanisms and representations. We outline an approach to imitation learning for reverse-engineering black box agent policies in MDP environments, yielding simplified, interpretable models in the form of decision trees. As part of this process, we explicitly model and learn agents' latent state representations by selecting from a large space of candidate features constructed from the Markov state.

artificial intelligence, decision tree learning, representation, (18 more...)

arXiv.org Artificial Intelligence

Jun-19-2020

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.65)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Decision Tree Learning (0.93)
  - Representation & Reasoning > Agents (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found