Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks

Klee, David, Biza, Ondrej, Platt, Robert

Jul-22-2022–arXiv.org Artificial Intelligence

Multi-goal policy learning for robotic manipulation is challenging. Prior successes have used state-based representations of the objects or provided demonstration data to facilitate learning. In this paper, by hand-coding a high-level discrete representation of the domain, we show that policies to reach dozens of goals can be learned with a single network using Q-learning from pixels. The agent focuses learning on simpler, local policies which are sequenced together by planning in the abstract space. We compare our method against standard multi-goal RL baselines, as well as other methods that leverage the discrete representation, on a challenging block construction domain. We find that our method can build more than a hundred different block structures, and demonstrate forward transfer to structures with novel objects. Lastly, we deploy the policy learned in simulation on a real robot.

agent, robot, subgoal, (14 more...)

arXiv.org Artificial Intelligence

Jul-22-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > California (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - Switzerland > Zürich
    - Zürich (0.14)
  - Sweden > Stockholm
    - Stockholm (0.04)
- Asia > Singapore
  - Central Region > Singapore (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found