Goto

Collaborating Authors

 Bayesian Inference











Distributionally Robust Imitation Learning

Neural Information Processing Systems

Decision Process (MDP) setting where the reward function is not given, but demonstrations from experts are available. Although the goal of imitation learning is to learn a policy that produces behaviors nearly as good as the experts' for a desired task, assumptions of consistent optimality for demonstrated behaviors are often violated in practice.