Object-Oriented Dynamics Predictor

Zhu, Guangxiang, Huang, Zhiao, Zhang, Chongjie

Feb-14-2020, 20:56:14 GMT–Neural Information Processing Systems

Generalization has been one of the major challenges for learning dynamics models in model-based reinforcement learning. However, previous work on action-conditioned dynamics prediction focuses on learning the pixel-level motion and thus does not generalize well to novel environments with different object layouts. In this paper, we present a novel object-oriented framework, called object-oriented dynamics predictor (OODP), which decomposes the environment into objects and predicts the dynamics of objects conditioned on both actions and object-to-object relations. It is an end-to-end neural network and can be trained in an unsupervised manner. To enable the generalization ability of dynamics learning, we design a novel CNN-based relation mechanism that is class-specific (rather than object-specific) and exploits the locality principle.

dynamic model, novel environment, object-oriented dynamic predictor, (3 more...)

Neural Information Processing Systems

Feb-14-2020, 20:56:14 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Software > Programming Languages (0.90)
  - Artificial Intelligence
    - Representation & Reasoning > Object-Oriented Architecture (1.00)
    - Machine Learning (0.88)