Meta-Learning Parameterized Skills

Fu, Haotian, Yu, Shangqun, Tiwari, Saket, Littman, Michael, Konidaris, George

Jul-19-2023–arXiv.org Artificial Intelligence

We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks. We propose to leverage off-policy Meta-RL combined with a trajectory-centric smoothness term to learn a set of parameterized skills. Our agent can use these learned skills to construct a three-level hierarchical framework that models a Temporally-extended Parameterized Action Markov Decision Process. We empirically demonstrate that the proposed algorithms enable an agent to solve a set of difficult long-horizon (obstacle-course and robot manipulation) tasks.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

Jul-19-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - Puerto Rico (0.04)
  - United States
    - Oregon (0.04)
    - Maryland > Baltimore (0.04)
    - Pennsylvania > Philadelphia County
      - Philadelphia (0.04)
    - Massachusetts
      - Middlesex County > Cambridge (0.04)
      - Hampshire County > Amherst (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California > Santa Clara County
      - Stanford (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Austria (0.04)
  - Portugal (0.04)
  - Czechia > Prague (0.04)
  - United Kingdom > Scotland
    - City of Edinburgh > Edinburgh (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia
  - Japan > Honshū
    - Kansai > Osaka Prefecture
      - Osaka (0.04)
    - Chūbu > Toyama Prefecture
      - Toyama (0.04)
  - China > Shanghai
    - Shanghai (0.04)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Research Report (0.82)

Industry:
- Education (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found