Meta-InverseReinforcementLearningwith ProbabilisticContextVariables

Neural Information Processing Systems 

Tothis end,wepropose adeep latent variable model thatiscapable oflearning rewards from demonstrations of distinct but related tasks in an unsupervised way.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found