Imitating Unknown Policies via Exploration

Gavenski, Nathan, Monteiro, Juarez, Granada, Roger, Meneguzzi, Felipe, Barros, Rodrigo C.

Aug-12-2020–arXiv.org Artificial Intelligence

Behavioral cloning is an imitation learning technique that teaches an agent how to behave through expert demonstrations. Recent approaches use self-supervision of fully-observable unlabeled snapshots of the states to decode state-pairs into actions. However, the iterative learning scheme from these techniques are prone to getting stuck into bad local minima. We address these limitations incorporating a two-phase model into the original framework, which learns from unlabeled observations via exploration, substantially improving traditional behavioral cloning by exploiting (i) a sampling mechanism to prevent bad local minima, (ii) a sampling mechanism to improve exploration, and (iii) self-attention modules to capture global features. The resulting technique outperforms the previous state-of-the-art in four different environments by a large margin.

agent, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Aug-12-2020

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - Rio Grande do Sul > Porto Alegre (0.04)
- Europe > Middle East
  - Malta (0.04)

Genre:
- Research Report > Promising Solution (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found