Learning Retrospective Knowledge with Reverse Reinforcement Learning Shangtong Zhang University of Oxford Vivek V eeriah University of Michigan, Ann Arbor Shimon Whiteson University of Oxford

Open in new window