Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning

Jun-16-2026, 21:14:09 GMT–Neural Information Processing Systems

Imitation learning (IL) is a paradigm for learning sequential decision-making policies from experts, leveraging offline demonstrations, interactive annotations, or both. Recent advances show that when annotation cost is tallied per trajectory, Behavior Cloning (BC)--which relies solely on offline demonstrations--cannot be improved in general, leaving limited conditions for interactive methods such as DAgger to help. We revisit this conclusion and prove that when the annotation cost is measured per state, algorithms using interactive annotations can provably outperform BC. Specifically: (1) we show that STAGGER, a one-sample-per-round variant of DAgger, provably beats BC under low-recovery-cost settings; (2) we initiate the study of hybrid IL where the agent learns from offline demonstrations and interactive annotations. We propose WARM-STAGGER whose learning guarantee is not much worse than using either data source alone.

annotation, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Jun-16-2026, 21:14:09 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.46)

Genre:
- Workflow (0.93)
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.67)

Industry:
- Education (0.46)
- Information Technology (0.46)
- Automobiles & Trucks (0.46)
- Transportation > Ground
  - Road (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning
    - Learning Graphical Models (0.93)
    - Neural Networks (0.92)
    - Reinforcement Learning (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found