Generalization to New Actions in Reinforcement Learning
Jain, Ayush, Szot, Andrew, Lim, Joseph J.
–arXiv.org Artificial Intelligence
A fundamental trait of intelligence is the ability to achieve goals in the face of novel circumstances, such as making decisions from new action choices. However, standard reinforcement learning assumes a fixed set of actions and requires expensive retraining when given a new action set. To make learning agents more adaptable, we introduce the problem of zero-shot generalization to new actions. We propose a two-stage framework where the agent first infers action representations from action information acquired separately from the task. A policy flexible to varying action sets is then trained with generalization objectives. We benchmark generalization on sequential tasks, such as selecting from an unseen tool-set to solve physical reasoning puzzles and stacking towers with novel 3D shapes. Videos and code are available at https://sites.google.com/view/action-generalization
arXiv.org Artificial Intelligence
Nov-3-2020
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- California > Los Angeles County > Long Beach (0.04)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (0.46)
- Technology: