Dyna-AIL : Adversarial Imitation Learning by Planning

Saxena, Vaibhav, Sivanandan, Srinivasan, Mathur, Pulkit

Mar-7-2019–arXiv.org Artificial Intelligence

Adversarial methods for imitation learning have been shown to perform well on various control tasks. However, they require a large number of environment interactions for convergence. In this paper, we propose an end-to-end differentiable adversarial imitation learning algorithm in a Dyna-like framework for switching between model-based planning and model-free learning from expert data. Our results on both discrete and continuous environments show that our approach of using model-based planning along with model-free learning converges to an optimal policy with fewer number of environment interactions in comparison to the state-of-the-art learning methods.

artificial intelligence, neural network, trajectory, (20 more...)

arXiv.org Artificial Intelligence

Mar-7-2019

arXiv.org PDF

Add feedback

Country:
- North America > Canada > Ontario > Toronto (0.31)

Genre:
- Research Report > New Finding (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (0.69)
  - Reinforcement Learning (0.47)
  - Statistical Learning > Gradient Descent (0.30)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found