Meta-InverseReinforcementLearningwith ProbabilisticContextVariables
–Neural Information Processing Systems
Tothis end,wepropose adeep latent variable model thatiscapable oflearning rewards from demonstrations of distinct but related tasks in an unsupervised way.
Neural Information Processing Systems
Feb-11-2026, 20:12:16 GMT
- Country:
- Technology: