Teaching Inverse Reinforcement Learners via Features and Demonstrations

Luis Haug, Sebastian Tschiatschek, Adish Singla

Feb-12-2026, 18:17:42 GMT–Neural Information Processing Systems

Weintroduceanaturalquantity,the teaching risk, which measures the potential suboptimality of policies that look optimal to the learner in this setting. We show that bounds on the teaching risk guarantee that the learner is able to find a near-optimal policy using standard algorithms basedoninversereinforcement learning. Basedonthesefindings, we suggest a teaching scheme in which the expert can decrease the teaching risk by updating the learner's worldview, and thus ultimately enable her to find a near-optimalpolicy.

machine learning, reinforcement learning, teaching risk, (18 more...)

Neural Information Processing Systems

Feb-12-2026, 18:17:42 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > Illinois
    - Cook County > Chicago (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Saarland
    - Saarbrücken (0.04)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Duplicate Docs Excel Report

Title
Teaching Inverse Reinforcement Learners via Features and Demonstrations
Teaching Inverse Reinforcement Learners via Features and Demonstrations

Similar Docs Excel Report more

Title	Similarity	Source
None found