Outcome-Driven Reinforcement Learning via Variational Inference Tim G. J. Rudner University of Oxford Vitchyr H. Pong

Neural Information Processing Systems 

Illustration of the shaping effect of the reward function derived from the goal-directed variational inference objective.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found