Imitation Learning from Imperfection: Theoretical Justifications and Algorithms

Dec-24-2025, 17:52:02 GMT–Neural Information Processing Systems

Imitation learning (IL) algorithms excel in acquiring high-quality policies from expert data for sequential decision-making tasks. But, their effectiveness is hampered when faced with limited expert data. To tackle this challenge, a novel framework called (offline) IL with supplementary data has been proposed, which enhances learning by incorporating an additional yet imperfect dataset obtained inexpensively from sub-optimal policies. Nonetheless, learning becomes challenging due to the potential inclusion of out-of-expert-distribution samples. In this work, we propose a mathematical formalization of this framework, uncovering its limitations.

imitation learning, theoretical justification, theoretical justification and algorithm, (8 more...)

Neural Information Processing Systems

Dec-24-2025, 17:52:02 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.79)