Strictly Batch Imitation Learning by Energy-based Distribution Matching Daniel Jarrett Ioana Bica Mihaela van der Schaar University of Cambridge University of Oxford University of Cambridge

Neural Information Processing Systems 

We argue that a good solution should be able to explicitly parameterize a policy (i.e.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found