Hierarchical Deep Q-Network with Forgetting from Imperfect Demonstrations in Minecraft

Skrynnik, Alexey, Staroverov, Aleksey, Aitygulov, Ermek, Aksenov, Kirill, Davydov, Vasilii, Panov, Aleksandr I.

arXiv.org Artificial Intelligence 

We present hierarchical Deep Q-Network with Forgetting (HDQF) that took first place in MineRL competition. HDQF works on imperfect demonstrations utilize hierarchical structure of expert trajectories extracting effective sequence of meta-actions and subgoals. We introduce structured task dependent replay buffer and forgetting technique that allow the HDQF agent to gradually erase poor-quality expert data from the buffer. In this paper we present the details of the HDQF algorithm and give the experimental results in Minecraft domain.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found