Compatible Reward Inverse Reinforcement Learning

Metelli, Alberto Maria, Pirotta, Matteo, Restelli, Marcello

Feb-14-2020, 09:28:43 GMT–Neural Information Processing Systems

Inverse Reinforcement Learning (IRL) is an effective approach to recover a reward function that explains the behavior of an expert by observing a set of demonstrations. This paper is about a novel model-free IRL approach that, differently from most of the existing IRL algorithms, does not require to specify a function space where to search for the expert's reward function. Leveraging on the fact that the policy gradient needs to be zero for any optimal policy, the algorithm generates a set of basis functions that span the subspace of reward functions that make the policy gradient vanish. Within this subspace, using a second-order criterion, we search for the reward function that penalizes the most a deviation from the expert's policy. After introducing our approach for finite domains, we extend it to continuous ones.

compatible reward inverse reinforcement learning, inverse reinforcement learning, reward function, (3 more...)

Neural Information Processing Systems

Feb-14-2020, 09:28:43 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.44)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)