Meta-Learning Parameterized Skills
Fu, Haotian, Yu, Shangqun, Tiwari, Saket, Littman, Michael, Konidaris, George
–arXiv.org Artificial Intelligence
We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks. We propose to leverage off-policy Meta-RL combined with a trajectory-centric smoothness term to learn a set of parameterized skills. Our agent can use these learned skills to construct a three-level hierarchical framework that models a Temporally-extended Parameterized Action Markov Decision Process. We empirically demonstrate that the proposed algorithms enable an agent to solve a set of difficult long-horizon (obstacle-course and robot manipulation) tasks.
arXiv.org Artificial Intelligence
Jul-19-2023
- Country:
- North America
- Puerto Rico (0.04)
- United States
- Oregon (0.04)
- Maryland > Baltimore (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Hampshire County > Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > Santa Clara County
- Stanford (0.04)
- Canada > British Columbia
- Europe
- Austria (0.04)
- Portugal (0.04)
- Czechia > Prague (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- France > Île-de-France
- Asia
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Education (0.48)
- Technology: