Optimal Decision Tree Policies for Markov Decision Processes

Jan-30-2023–arXiv.org Artificial Intelligence

Interpretability of reinforcement learning policies is essential for many real-world tasks but learning such interpretable policies is a hard problem. Particularly rule-based policies such as decision trees and rules lists are difficult to optimize due to their non-differentiability. While existing techniques can learn verifiable decision tree policies there is no guarantee that the learners generate a decision that performs optimally. In this work, we study the optimization of size-limited decision trees for Markov Decision Processes (MPDs) and propose OMDTs: Optimal MDP Decision Trees. Given a user-defined size limit and MDP formulation OMDT directly maximizes the expected discounted return for the decision tree using Mixed-Integer Linear Programming. By training optimal decision tree policies for different MDPs we empirically study the optimality gap for existing imitation learning techniques and find that they perform sub-optimally. We show that this is due to an inherent shortcoming of imitation learning, namely that complex policies cannot be represented using size-limited trees. In such cases, it is better to directly optimize the tree for expected return. While there is generally a trade-off between the performance and interpretability of machine learning models, we find that OMDTs limited to a depth of 3 often perform close to the optimal limit.

artificial intelligence, decision tree, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Jan-30-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.14)
- Europe > Netherlands
  - South Holland > Delft (0.04)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment > Games (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Diagnosis (1.00)
  - Machine Learning
    - Decision Tree Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.84)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found