CEIL: Generalized Contextual Imitation Learning Li He1 Zifeng Zhuang 1,2
–Neural Information Processing Systems
Inspired by the formulation of hindsight information matching, we derive CEIL by explicitly learning a hindsight embedding function together with a contextual policy using the hindsight embeddings. To achieve the expert matching objective for IL, we advocate for optimizing a contextual variable such that it biases the contextual policy towards mimicking expert behaviors. Beyond the typical learning from demonstrations (LfD) setting, CEIL is a generalist that can be effectively applied to multiple settings including: 1) learning from observations (LfO), 2) offline IL, 3) cross-domain IL (mismatched experts), and 4) one-shot IL settings.
Neural Information Processing Systems
May-25-2025, 15:57:28 GMT
- Country:
- North America > United States (0.14)
- Genre:
- Instructional Material > Course Syllabus & Notes (0.46)
- Research Report > New Finding (0.67)
- Industry:
- Education > Educational Setting > Online (0.46)
- Technology: