6f518c31f6baa365f55c38d11cc349d1-AuthorFeedback.pdf

Neural Information Processing Systems 

Thetrajectories6 may bifurcate to take different paths to the goal, (as in BiMGame), but our method is able to efficiently learn the7 subgoals. Non-trivial tasks: BiMGame and AntTarget are37 non-trivial tasks as it fails without reward shaping.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found