GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations
Huber, Tobias, Demmler, Maximilian, Mertes, Silvan, Olson, Matthew L., André, Elisabeth
–arXiv.org Artificial Intelligence
Counterfactual explanations are a common tool to explain artificial intelligence models. For Reinforcement Learning (RL) agents, they answer "Why not?" or "What if?" questions by illustrating what minimal change to a state is needed such that an agent chooses a different action. Generating counterfactual explanations for RL agents with visual input is especially challenging because of their large state spaces and because their decisions are part of an overarching policy, which includes long-term decision-making. However, research focusing on counterfactual explanations, specifically for RL agents with visual input, is scarce and does not go beyond identifying defective agents. It is unclear whether counterfactual explanations are still helpful for more complex tasks like analyzing the learned strategies of different agents or choosing a fitting agent for a specific task. We propose a novel but simple method to generate counterfactual explanations for RL agents by formulating the problem as a domain transfer problem which allows the use of adversarial learning techniques like StarGAN. Our method is fully model-agnostic and we demonstrate that it outperforms the only previous method in several computational metrics. Furthermore, we show in a user study that our method performs best when analyzing which strategies different agents pursue.
arXiv.org Artificial Intelligence
Feb-24-2023
- Country:
- North America
- United States
- Oregon > Benton County
- Corvallis (0.04)
- New York
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Oregon > Benton County
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- Germany (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Basque Country
- Biscay Province > Bilbao (0.04)
- Slovenia > Drava
- Municipality of Benedikt > Benedikt (0.04)
- Netherlands > North Brabant
- Eindhoven (0.04)
- Asia > Middle East
- Jordan (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Questionnaire & Opinion Survey (0.89)
- Research Report
- Experimental Study (0.93)
- New Finding (0.67)
- Industry:
- Health & Medicine (0.46)
- Technology: