Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration

Open in new window