Visual Adversarial Imitation Learning using Variational Models Rafael Rafailov 1 Tianhe Y u

Neural Information Processing Systems 

Reward function specification, which requires considerable human effort and iteration, remains a major impediment for learning behaviors through deep reinforcement learning.