Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion

Kolter, J. Z., Abbeel, Pieter, Ng, Andrew Y.

Dec-31-2008–Neural Information Processing Systems

We consider apprenticeship learning--learning from expert demonstrations--in the setting of large, complex domains. Past work in apprenticeship learning requires that the expert demonstrate complete trajectories through the domain. However, in many problems even an expert has difficulty controlling the system, which makes this approach infeasible. For example, consider the task of teaching aquadruped robot to navigate over extreme terrain; demonstrating an optimal policy (i.e., an optimal set of foot locations over the entire terrain) is a highly nontrivial task, even for an expert. In this paper we propose a method for hierarchical apprenticeshiplearning, which allows the algorithm to accept isolated advice at different hierarchical levels of the control task. This type of advice is often feasible for experts to give, even if the expert is unable to demonstrate complete trajectories.This allows us to extend the apprenticeship learning paradigm to much larger, more challenging domains. In particular, in this paper we apply the hierarchical apprenticeship learning algorithm to the task of quadruped locomotion overextreme terrain, and achieve, to the best of our knowledge, results superior to any previously published work.

apprenticeship, constraint, terrain, (14 more...)

Neural Information Processing Systems

Dec-31-2008

Conferences PDF

Add feedback

Country:
- North America > United States > California > Santa Clara County
  - Palo Alto (0.04)
  - Stanford (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion

Similar Docs Excel Report more

Title	Similarity	Source
None found