layer resolution stride norm
Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)
Country:
- North America > United States > California > San Diego County > San Diego (0.04)
- North America > Canada (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Refactoring Policy for Compositional Generalizability using Self-Supervised Object Proposals
Mu, Tongzhou, Gu, Jiayuan, Jia, Zhiwei, Tang, Hao, Su, Hao
We study how to learn a policy with compositional generalizability. We propose a two-stage framework, which refactorizes a high-reward teacher policy into a generalizable student policy with strong inductive bias. Particularly, we implement an object-centric GNN-based student policy, whose input objects are learned from images through self-supervised learning. Empirically, we evaluate our approach on four difficult tasks that require compositional generalizability, and achieve superior performance compared to baselines.
2011.00971
Country:
- North America > United States > California > San Diego County > San Diego (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
Industry:
- Education (0.68)
- Leisure & Entertainment > Games (0.46)
Technology: