Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning

Choi, Jaedeug (Korea Advanced Institute of Science and Technology (KAIST)) | Kim, Kee-Eung (Korea Advanced Institute of Science and Technology (KAIST))

Aug-3-2013–AAAI Conferences

Most of the algorithms for inverse reinforcement learning (IRL) assume that the reward function is a linear function of the pre-defined state and action features. However, it is often difficult to manually specify the set of features that can make the true reward function representable as a linear function. We propose a Bayesian nonparametric approach to identifying useful composite features for learning the reward function. The composite features are assumed to be the logical conjunctions of the predefined atomic features so that we can represent the reward function as a linear function of the composite features. We empirically show that our approach is able to learn composite features that capture important aspects of the reward function on synthetic domains, and predict taxi drivers’ behaviour with high accuracy on a real GPS trace dataset.

bayesian nonparametric feature construction, inverse reinforcement learning

AAAI Conferences

Aug-3-2013

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found