Refactoring Policy for Compositional Generalizability using Self-Supervised Object Proposals
Mu, Tongzhou, Gu, Jiayuan, Jia, Zhiwei, Tang, Hao, Su, Hao
–arXiv.org Artificial Intelligence
We study how to learn a policy with compositional generalizability. We propose a two-stage framework, which refactorizes a high-reward teacher policy into a generalizable student policy with strong inductive bias. Particularly, we implement an object-centric GNN-based student policy, whose input objects are learned from images through self-supervised learning. Empirically, we evaluate our approach on four difficult tasks that require compositional generalizability, and achieve superior performance compared to baselines.
arXiv.org Artificial Intelligence
Oct-26-2020
- Country:
- Genre:
- Research Report (0.50)
- Industry:
- Education (0.68)
- Leisure & Entertainment > Games (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.46)
- Reinforcement Learning (0.94)
- Statistical Learning (0.68)
- Representation & Reasoning (1.00)
- Vision (0.93)
- Machine Learning
- Information Technology > Artificial Intelligence