Non-parametric Differentially Private Confidence Intervals for the Median
Drechsler, Joerg, Globus-Harris, Ira, McMillan, Audra, Sarathy, Jayshree, Smith, Adam
Differential privacy is a restriction on data processing algorithms that provides strong confidentiality guarantees for individual records in the data. However, research on proper statistical inference, that is, research on properly quantifying the uncertainty of the (noisy) sample estimate regarding the true value in the population, is currently still limited. This paper proposes and evaluates several strategies to compute valid differentially private confidence intervals for the median. Instead of computing a differentially private point estimate and deriving its uncertainty, we directly estimate the interval bounds and discuss why this approach is superior if ensuring privacy is important. We also illustrate that addressing both sources of uncertainty--the error from sampling and the error from protecting the output--simultaneously should be preferred over simpler approaches that incorporate the uncertainty in a sequential fashion. We evaluate the performance of the different algorithms under various parameter settings in extensive simulation studies and demonstrate how the findings could be applied in practical settings using data from the 1940 Decennial Census.
Jul-3-2021
- Country:
- Europe > Germany (0.04)
- North America
- United States
- Utah (0.04)
- Nevada (0.04)
- Arizona (0.04)
- New Mexico (0.04)
- Idaho (0.04)
- Colorado (0.04)
- Wyoming (0.04)
- Montana (0.04)
- Maryland (0.04)
- Pennsylvania (0.04)
- District of Columbia > Washington (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- California
- San Diego County > San Diego (0.04)
- Monterey County > Monterey (0.04)
- New York > New York County
- New York City (0.14)
- Canada > British Columbia
- United States
- Genre:
- Overview (0.92)
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Industry:
- Technology: