Reviews: Compositional Plan Vectors

Jan-21-2025, 04:01:25 GMT–Neural Information Processing Systems

Summary The paper proposes a new method for better and more efficient generalization to more complex tasks at test time in the setting of one-shot imitation learning. The main idea is to condition the policy on the difference between the embedding of some reference trajectory and the a partial trajectory of the agent (for the same task, but starting from a potentially different state of the environment). Main Comments I found the experimental section to be slightly thin and I would like to see how this method performs on at least another more complex task. It would also be good to include a discussion on the types of environments where we can expect this to perform best and where we can expect it to fail or perform worse than other relevant algorithms. I also think more comparisons with other approaches for one-shot imitation learning (such as Duan et al. 2017) are needed for strengthening the paper.

compositional plan vector, generalization, one-shot imitation learning, (5 more...)

Neural Information Processing Systems

Jan-21-2025, 04:01:25 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.52)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning (1.00)