Privacy-preserving data release leveraging optimal transport and particle gradient descent
Donhauser, Konstantin, Abad, Javier, Hulkund, Neha, Yang, Fanny
–arXiv.org Artificial Intelligence
We present a novel approach for differentially private data synthesis of protected tabular datasets, a relevant task in highly sensitive domains such as healthcare and government. Current state-of-the-art methods predominantly use marginal-based approaches, where a dataset is generated from private estimates of the marginals. In this paper, we introduce PrivPGD, a new generation method for marginal-based private data synthesis, leveraging tools from optimal transport and particle gradient descent. Our algorithm outperforms existing methods on a large range of datasets while being highly scalable and offering the flexibility to incorporate additional domain-specific constraints.
arXiv.org Artificial Intelligence
Feb-12-2024
- Country:
- North America > United States
- California (0.04)
- New York (0.04)
- Europe
- France (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- North America > United States
- Genre:
- Research Report > Promising Solution (0.68)
- Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (0.87)
- Transportation > Ground
- Road (0.46)
- Government > Regional Government
- Technology: