Hierarchical Few-Shot Imitation with Skill Transition Models

Hakhamaneshi, Kourosh, Zhao, Ruihan, Zhan, Albert, Abbeel, Pieter, Laskin, Michael

Jul-19-2021–arXiv.org Artificial Intelligence

A desirable property of autonomous agents is the ability to both solve long-horizon problems and generalize to unseen tasks. Recent advances in data-driven skill learning have shown that extracting behavioral priors from offline data can enable agents to solve challenging long-horizon tasks with reinforcement learning. However, generalization to tasks unseen during behavioral prior training remains an outstanding challenge. To this end, we present Few-shot Imitation with Skill Transition Models (FIST), an algorithm that extracts skills from offline data and utilizes them to generalize to unseen tasks given a few downstream demonstrations. FIST learns an inverse skill dynamics model, a distance function, and utilizes a semi-parametric approach for imitation. We show that FIST is capable of generalizing to new tasks and substantially outperforms prior baselines in navigation experiments requiring traversing unseen parts of a large maze and 7-DoF robotic arm experiments requiring manipulating previously unseen objects in a kitchen.

dataset, demonstration, imitation, (14 more...)

arXiv.org Artificial Intelligence

Jul-19-2021

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Queensland > Brisbane (0.04)
- North America > United States
  - Florida > Broward County
    - Fort Lauderdale (0.04)
  - Colorado > Denver County
    - Denver (0.04)
  - California
    - Los Angeles County > Long Beach (0.04)
    - Alameda County > Berkeley (0.04)
    - Santa Clara County
      - Stanford (0.04)
      - Mountain View (0.04)
- Europe > Spain
  - Catalonia > Barcelona Province > Barcelona (0.04)

Genre:
- Research Report (1.00)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (0.90)