Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
Chao Yang, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu, Junzhou Huang, Chuang Gan
–Neural Information Processing Systems
Incontrast toLearning fromDemonstration (LfD) that involves both action and state supervision, LfO is more practical in leveraging previously inapplicable resources (e.g.
Neural Information Processing Systems
Feb-14-2026, 23:36:59 GMT