AdversarialSoftAdvantageFitting: ImitationLearningwithoutPolicyOptimization

Neural Information Processing Systems 

When optimized, this discriminator directly learns the optimal generator'spolicy.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found