Differentially Private Distributed Data Summarization under Covariate Shift

Sarpatwar, Kanthi, Shanmugam, Karthikeyan, Ganapavarapu, Venkata Sitaramagiridharganesh, Jagmohan, Ashish, Vaculin, Roman

Neural Information Processing Systems 

We use the popular Maximum Mean Discrepancy as the measure of statistical distance. The non-private problem has received considerable attention in prior art, for example in prototype selection (Kim et al., NIPS 2016). Our work is the first to obtain strong differential privacy guarantees while ensuring the quality guarantees of the non-private version. We study this problem in a Parsimonious Curator Privacy Model, where a trusted curator coordinates the summarization process while minimizing the amount of private information accessed.