Wasserstein Learning of Deep Generative Point Process Models