Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization Julien Roy

Open in new window