Active Imitation Learning via Reduction to I.I.D. Active Learning

Judah, Kshitij (Oregon State University) | Fern, Alan Paul (Oregon State University) | Dietterich, Thomas Glenn (Oregon State University)

Nov-5-2012–AAAI Conferences

In standard passive imitation learning, the goal is to learn an expert’s policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider Active Imitation Learning (AIL) with the goal of reducing this effort by querying the expert about the desired action at individual states, which are selected based on answers to past queries and the learner’s interactions with an environment simulator. Our new approach is based on reducing AIL to i.i.d. active learning, which can leverage progress in the i.i.d. setting. We introduce and analyze reductions for both non-stationary and stationary policies, showing that the label complexity (number of queries) of AIL can be substantially less than passive learning. We also introduce a practical algorithm inspired by the reductions, which is shown to be highly effective in four test domains compared to a number of alternatives.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

AAAI Conferences

Nov-5-2012

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts (0.04)
  - Wisconsin > Dane County
    - Madison (0.04)
  - Oregon > Benton County
    - Corvallis (0.04)

Industry:
- Education (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.46)
  - Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found