Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization Paul Barde

Open in new window