Creating a Public Repository for Joining Private Data
–Neural Information Processing Systems
How can one publish a dataset with sensitive attributes in a way that both preserves privacy and enables joins with other datasets on those same sensitive attributes? This problem arises in many contexts, e.g., a hospital and an airline may want to jointly determine whether people who take long-haul flights are more likely to catch respiratory infections. If they join their data by a common keyed user identifier such as email address, they can determine the answer, though it breaks privacy. This paper shows how the hospital can generate a private sketch and how the airline can privately join with the hospital's sketch by email address. The proposed solution satisfies pure differential privacy and gives approximate answers to linear queries and optimization problems over those joins.
Neural Information Processing Systems
Jan-20-2025, 00:47:00 GMT
- Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (0.40)
- Transportation > Air (0.63)
- Technology: