Conditional Distribution Compression via the Kernel Conditional Mean Embedding

Neural Information Processing Systems 

Existing distribution compression methods, like Kernel Herding (KH), were originally developed for unlabelled data.