Imitation Learning by Coaching
He, He, Eisner, Jason, Daume, Hal
–Neural Information Processing Systems
Imitation Learning has been shown to be successful in solving many challenging real-world problems. Some recent approaches give strong performance guarantees by training the policy iteratively. However, it is important to note that these guarantees depend on how well the policy we found can imitate the oracle on the training data. When there is a substantial difference between the oracle's ability and the learner's policy space, we may fail to find a policy that has low error on the training set. In such cases, we propose to use a coach that demonstrates easy-to-learn actions for the learner and gradually approaches the oracle.
Neural Information Processing Systems
Feb-15-2020, 00:27:29 GMT
- Technology: