Common Questions policies than is typical in imitation learning literature, they still need to have meaningful behaviors in order to provide

Neural Information Processing Systems 

We now see how the term "expert" can be misleading. In the revision, we will replace it with "oracle" We plan to include more results on harder domains. We will further clarify this in the revision. We will clarify and better motivate these equations. We will include an ablation study of UpdateInputWhitening in the revision.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found