Multi-Agent Generative Adversarial Imitation Learning

Jiaming Song, Hongyu Ren, Dorsa Sadigh, Stefano Ermon

Neural Information Processing Systems 

Imitation learning algorithms can be used to learn a policy from expert demonstrations without access to a reward signal.