Hierarchical Deep Q-Network with Forgetting from Imperfect Demonstrations in Minecraft
Skrynnik, Alexey, Staroverov, Aleksey, Aitygulov, Ermek, Aksenov, Kirill, Davydov, Vasilii, Panov, Aleksandr I.
–arXiv.org Artificial Intelligence
We present hierarchical Deep Q-Network with Forgetting (HDQF) that took first place in MineRL competition. HDQF works on imperfect demonstrations utilize hierarchical structure of expert trajectories extracting effective sequence of meta-actions and subgoals. We introduce structured task dependent replay buffer and forgetting technique that allow the HDQF agent to gradually erase poor-quality expert data from the buffer. In this paper we present the details of the HDQF algorithm and give the experimental results in Minecraft domain.
arXiv.org Artificial Intelligence
Dec-18-2019
- Country:
- North America > Canada (0.04)
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.06)
- United Kingdom > England
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment > Games > Computer Games (1.00)
- Technology: