PILLAR: How to make semi-private learning more effective
Pinto, Francesco, Hu, Yaxi, Yang, Fanny, Sanyal, Amartya
–arXiv.org Artificial Intelligence
In Semi-Supervised Semi-Private (SP) learning, the learner has access to both public unlabelled and private labelled data. We propose a computationally efficient algorithm that, under mild assumptions on the data, provably achieves significantly lower private labelled sample complexity and can be efficiently run on real-world datasets. For this purpose, we leverage the features extracted by networks pre-trained on public (labelled or unlabelled) data, whose distribution can significantly differ from the one on which SP learning is performed. To validate its empirical effectiveness, we propose a wide variety of experiments under tight privacy constraints ($\epsilon = 0.1$) and with a focus on low-data regimes. In all of these settings, our algorithm exhibits significantly improved performance over available baselines that use similar amounts of public data.
arXiv.org Artificial Intelligence
Jun-6-2023
- Country:
- Europe > Switzerland (0.28)
- North America > United States (0.28)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (1.00)
- Technology: