Interpreting Neural Policies with Disentangled Tree Representations

Wang, Tsun-Hsuan, Xiao, Wei, Seyde, Tim, Hasani, Ramin, Rus, Daniela

Nov-12-2023–arXiv.org Machine Learning

This lack of transparency, often referred to as the "black box" problem, makes it hard to interpret the workings of learning-based robot control systems. Understanding why a particular decision was made or predicting how the system will behave in future scenarios remains a challenge, yet critical for physical deployments. Through the lens of representation learning, we assume that neural networks capture a set of processes that exist in the data distribution; for robots, they manifest learned skills, behaviors, or strategies, which are critical to understand the decision-making of a policy. However, while these factors of variation [1] (e.g., color or shape representations) are actively studied in unsupervised learning for disentangled representation, in robot learning, they are less well-defined and pose unique challenges due to the intertwined correspondence of neural activities with emergent behaviors unknown a priori. In the present study, we aim to (i) provide a useful definition of factors of variation for policy learning, and (ii) explore how to uncover dynamics and factors of variation quantitatively as a measure of interpretability in compact neural networks for closed-loop end-to-end control applica-7th Conference on Robot Learning (CoRL 2023), Atlanta, USA.

artificial intelligence, machine learning, neuron response, (16 more...)

arXiv.org Machine Learning

Nov-12-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.47)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Energy (0.34)
- Transportation (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Representation & Reasoning > Agents (0.87)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found