Domain-Invariant Per-Frame Feature Extraction for Cross-Domain Imitation Learning with Visual Observations

Kim, Minung, Lee, Kawon, Kim, Jungmo, Choi, Sungho, Han, Seungyul

Feb-14-2025–arXiv.org Artificial Intelligence

Imitation learning (IL) enables agents to mimic expert behavior without reward signals but faces challenges in cross-domain scenarios with high-dimensional, noisy, and incomplete visual observations. To address this, we propose Domain-Invariant Per-Frame Feature Extraction for Imitation Learning (DIFF-IL), a novel IL method that extracts domain-invariant features from individual frames and adapts them into sequences to isolate and replicate expert behaviors. We also introduce a frame-wise time labeling technique to segment expert behaviors by timesteps and assign rewards aligned with temporal contexts, enhancing task performance. Experiments across diverse visual environments demonstrate the effectiveness of DIFF-IL in addressing complex visual tasks.

artificial intelligence, data mining, machine learning, (14 more...)

arXiv.org Artificial Intelligence

Feb-14-2025

arXiv.org PDF

Add feedback

Country:
- Asia > South Korea (0.28)
- North America > United States (0.46)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks (0.93)
    - Representation & Reasoning (1.00)
    - Robots (1.00)
  - Data Science > Data Mining
    - Feature Extraction (0.61)