Reviews: Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies

Oct-7-2024, 03:42:50 GMT–Neural Information Processing Systems

The paper introduces an RL problem where the agent is required to execute a given subtask graph which describes a set of subtasks and their dependency, and proposes a neural subtask graph solver (NSS) to solve this problem. In NSS, there are an observation module to capture the environment information using CNN, and a task module to encode the subtask graph using recursive-reverse-recursive neural network (R3NN). A non-parametric reward-propagation policy (RProp) is proposed to pre-train the NSS agent and further finetune it through actor-critic method. In general, the problem introduced in this paper is interesting and the method which uses CNN to capture the observation information and R3NN to encode the subtask graph is a good idea. Cons: 1. Writing: many details of the proposed method are included in the supplementary material which makes it difficult to understand by reading the main paper only.

hierarchical reinforcement learning, subtask dependency, zero-shot generalization, (7 more...)

Neural Information Processing Systems

Oct-7-2024, 03:42:50 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.40)
  - Machine Learning > Reinforcement Learning (0.40)