RobustImitationvia MirrorDescentInverseReinforcementLearning

Feb-11-2026, 18:16:17 GMT–Neural Information Processing Systems

Inspired by a first-order optimization method called mirror descent, this paper proposes topredict asequence ofrewardfunctions, which areiterativesolutions for a constrained convex problem. IRL solutions derived by mirror descent are tolerant totheuncertainty incurred bytargetdensity estimation sincetheamount of reward learning is regulated with respect to local geometric constraints.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Feb-11-2026, 18:16:17 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (0.70)

Duplicate Docs Excel Report

Title
Robust Imitation via Mirror Descent Inverse Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found