Domain-Invariant Per-Frame Feature Extraction for Cross-Domain Imitation Learning with Visual Observations
Kim, Minung, Lee, Kawon, Kim, Jungmo, Choi, Sungho, Han, Seungyul
–arXiv.org Artificial Intelligence
Imitation learning (IL) enables agents to mimic expert behavior without reward signals but faces challenges in cross-domain scenarios with high-dimensional, noisy, and incomplete visual observations. To address this, we propose Domain-Invariant Per-Frame Feature Extraction for Imitation Learning (DIFF-IL), a novel IL method that extracts domain-invariant features from individual frames and adapts them into sequences to isolate and replicate expert behaviors. We also introduce a frame-wise time labeling technique to segment expert behaviors by timesteps and assign rewards aligned with temporal contexts, enhancing task performance. Experiments across diverse visual environments demonstrate the effectiveness of DIFF-IL in addressing complex visual tasks.
arXiv.org Artificial Intelligence
Feb-14-2025
- Country:
- Asia > South Korea (0.28)
- North America > United States (0.46)
- Genre:
- Research Report > New Finding (0.46)
- Technology: