Risk-Sensitive Generative Adversarial Imitation Learning
Lacotte, Jonathan, Chow, Yinlam, Ghavamzadeh, Mohammad, Pavone, Marco
Yinlam Chow DeepMind We study risk-sensitive imitation learning where the agent's goal is to perform at least as well as the expert in terms of a risk profile. We first formulate our risk-sensitive imitation learning setting. We consider the generative adversarial approach to imitation learning (GAIL) and derive an optimization problem for our formulation, which we call risk-sensitive GAIL (RS-GAIL). We then derive two different versions of our RS-GAIL optimization problem that aim at matching the risk profiles of the agent and the expert w.r.t. Jensen-Shannon (JS) divergence and Wasserstein distance, and develop risk-sensitive generative adversarial imitation learning algorithms based on these optimization problems. We evaluate the performance of our JSbased algorithm and compare it with GAIL and the risk-averse imitation learning (RAIL) algorithm in two Mu-JoCo tasks.
Aug-13-2018