Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble

Open in new window