Active Imitation Learning of Hierarchical Policies

Hamidi, Mandana (Oregon State University) | Tadepalli, Prasad (Oregon State University) | Goetschalckx, Robby (Oregon State University) | Fern, Alan (Oregon State University)

Jul-15-2015–AAAI Conferences

However, by being autonomous, structure of the policy, which is often critical for understanding these approaches have the problem of discovering the demonstration, is unobserved. We unnatural hierarchies, which may be difficult to interpret and formulate this problem as active learning of Probabilistic communicate to people. State-Dependent Grammars (PSDGs) from In this paper, we study the problem of learning policies demonstrations. Given a set of expert demonstrations, with hierarchical structure from demonstrations of a teacher our approach learns a hierarchical policy by whose policy is structured hierarchically, with natural applications actively selecting demonstrations and using queries to problems such as tutoring arithmetic, cooking, and to explicate their intentional structure at selected furniture assembly. A key challenge in this problem is that the points. Our contributions include a new algorithm demonstrations do not reveal the hierarchical task structure of for imitation learning of hierarchical policies and the teacher. Rather, only ground states and teacher actions are principled heuristics for the selection of demonstrations directly observable. This can lead to significant ambiguity in and queries.

hierarchical policy, query, trajectory, (15 more...)

AAAI Conferences

Jul-15-2015

Conferences PDF

Add feedback

Country:
- North America > United States
  - Oregon > Benton County
    - Corvallis (0.04)
  - California > San Francisco County
    - San Francisco (0.14)
- Europe > Finland
  - Uusimaa > Helsinki (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Grammars & Parsing (0.53)
  - Machine Learning
    - Inductive Learning (0.68)
    - Supervised Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found