Reviews: Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning

Oct-8-2024, 06:56:52 GMT–Neural Information Processing Systems

Post rebuttal: I now understand the middle ground this paper is positioned, and the difference to propositional OO representations where you don't necessarily care which instance of an object type you're dealing with, which significantly reduces the dimensionality of learning transition dynamics. But this is still similar to other work on graph neural networks for model learning in fully relational representations, like Relation Networks by Santoro et al., and Interaction Networks by Battaglia et al. which in worst case learn T * n * (n-1) relations for n objects for T types of relations. However, this paper does do a nice job of formalizing from the OO-MDP and Propositional MDP setting as opposed to the two papers I mentioned which do not, and focus on the physical dynamics case. I am willing to increase my score based on this, but still do not think it is novel enough to be accepted. This is very similar to relational MDPs, but they learn transition dynamics in this relational attribute space rather than real state space.

deictic object-oriented representation, reinforcement learning, transition dynamic, (6 more...)

Neural Information Processing Systems

Oct-8-2024, 06:56:52 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Object-Oriented Architecture (0.43)
  - Natural Language > Large Language Model (0.43)
  - Machine Learning > Reinforcement Learning (0.40)