Bridging the Imitation Gap by Adaptive Insubordination

Dec-24-2025, 14:50:54 GMT–Neural Information Processing Systems

In practice, imitation learning is preferred over pure reinforcement learning whenever it is possible to design a teaching agent to provide expert supervision. However, we show that when the teaching agent makes decisions with access to privileged information that is unavailable to the student, this information is marginalized during imitation learning, resulting in an imitation gap and, potentially, poor results.

adaptive insubordination, imitation gap, name change, (6 more...)

Neural Information Processing Systems

Dec-24-2025, 14:50:54 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.38)