Error Bounds of Imitating Policies and Environments

Neural Information Processing Systems 

Imitation learning trains a policy by mimicking expert demonstrations. V arious imitation methods were proposed and empirically evaluated, meanwhile, their theoretical understanding needs further studies. In this paper, we firstly analyze the value gap between the expert policy and imitated policies by two imitation methods, behavioral cloning and generative adversarial imitation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found