Outcome-Driven Reinforcement Learning via Variational Inference

Open in new window