CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

Patel, Sagar, Jyothi, Sangeetha Abdu, Narodytska, Nina

Dec-18-2023–arXiv.org Artificial Intelligence

We present CrystalBox, a novel, model-agnostic, posthoc explainability framework for Deep Reinforcement Learning (DRL) controllers in the large family of input-driven environments which includes computer systems. We combine the natural decomposability of reward functions in input-driven environments with the explanatory power of decomposed returns. We propose an efficient algorithm to generate future-based explanations across both discrete and continuous control environments. Using applications such as adaptive bitrate streaming and congestion control, we demonstrate CrystalBox's capability to generate high-fidelity explanations. We further illustrate its higher utility across three practical use cases: contrastive explanations, network observability, and guided reward design, as opposed to prior explainability techniques that identify salient features.

controller, crystalbox, explanation, (17 more...)

arXiv.org Artificial Intelligence

Dec-18-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States > California
  - Santa Clara County > Palo Alto (0.04)
  - Orange County > Irvine (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.46)
  - Performance Analysis > Accuracy (0.31)