Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

Oct-11-2024, 04:37:54 GMT–Neural Information Processing Systems

Inspired by recent work in attention models for image captioning and question answering, we present a soft attention model for the reinforcement learning domain. This model bottlenecks the view of an agent by a soft, top-down attention mechanism, forcing the agent to focus on task-relevant information by sequentially querying its view of the environment. The output of the attention mechanism allows direct observation of the information used by the agent to select its actions, enabling easier interpretation of this model than of traditional models. We analyze the different strategies the agents learn and show that a handful of strategies arise repeatedly across different games. We also show that the model learns to query separately about space and content (where'' vs.what'').

attention augmented agent, attention model, interpretable reinforcement learning, (2 more...)

Neural Information Processing Systems

Oct-11-2024, 04:37:54 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)