Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

Scheller, Christian, Schraner, Yanick, Vogel, Manfred

Mar-12-2020–arXiv.org Machine Learning

Sample inefficiency of deep reinforcement learning methods is a major obstacle for their use in real-world applications. In this work, we show how human demonstrations can improve final performance of agents on the Minecraft minigame ObtainDiamond with only 8M frames of environment interaction. We propose a training procedure where policy networks are first trained on human data and later fine-tuned by reinforcement learning. Using a policy exploitation mechanism, experience replay and an additional loss against catastrophic forgetting, our best agent was able to achieve a mean score of 48. Our proposed solution placed 3 rd in the NeurIPS MineRL Competition for Sample-Efficient Reinforcement Learning.

demonstration, learning, sample efficient reinforcement learning, (11 more...)

arXiv.org Machine Learning

Mar-12-2020

arXiv.org PDF

Add feedback

Country:
- Europe > Switzerland (0.04)

Genre:
- Research Report (1.00)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.73)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found