Explainable Reinforcement Learning via a Causal World Model

Yu, Zhongwei, Ruan, Jingqing, Xing, Dengpeng

May-23-2023–arXiv.org Artificial Intelligence

Generating explanations for reinforcement learning (RL) is challenging as actions may produce long-term effects on the future. In this paper, we develop a novel framework for explainable RL by learning a causal world model without prior knowledge of the causal structure of the environment. The model captures the influence of actions, allowing us to interpret the long-term effects of actions through causal chains, which present how actions influence environmental variables and finally lead to rewards. Different from most explanatory models which suffer from low accuracy, our model remains accurate while improving explainability, making it applicable in model-based learning. As a result, we demonstrate that our causal model can serve as the bridge between explainability and learning.

causal chain, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

May-23-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York (0.04)
    - Maine (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Germany (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
    - West Sussex (0.04)
  - Switzerland > Vaud
    - Lausanne (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
- Asia
  - China (0.04)
  - South Korea > Seoul
    - Seoul (0.04)

Genre:
- Research Report (1.00)

Industry:
- Leisure & Entertainment > Games (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (0.92)
  - Cognitive Science > Problem Solving (0.70)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found