Inverse Reinforcement Learning with the Average Reward Criterion

Dec-26-2025, 22:42:58 GMT–Neural Information Processing Systems

We study the problem of Inverse Reinforcement Learning (IRL) with an average-reward criterion. The goal is to recover an unknown policy and a reward function when the agent only has samples of states and actions from an experienced agent. Previous IRL methods assume that the expert is trained in a discounted environment, and the discount factor is known.

average reward criterion, inverse reinforcement learning, name change, (6 more...)

Neural Information Processing Systems

Dec-26-2025, 22:42:58 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)