Differential Privacy with Random Projections and Sign Random Projections
–arXiv.org Artificial Intelligence
In this paper, we develop a series of differential privacy (DP) algorithms from a family of random projections (RP) for general applications in machine learning, data mining, and information retrieval. Among the presented algorithms, iDP-SignRP is remarkably effective under the setting of ``individual differential privacy'' (iDP), based on sign random projections (SignRP). Also, DP-SignOPORP considerably improves existing algorithms in the literature under the standard DP setting, using ``one permutation + one random projection'' (OPORP), where OPORP is a variant of the celebrated count-sketch method with fixed-length binning and normalization. Without taking signs, among the DP-RP family, DP-OPORP achieves the best performance. Our key idea for improving DP-RP is to take only the signs, i.e., $sign(x_j) = sign\left(\sum_{i=1}^p u_i w_{ij}\right)$, of the projected data. The intuition is that the signs often remain unchanged when the original data ($u$) exhibit small changes (according to the ``neighbor'' definition in DP). In other words, the aggregation and quantization operations themselves provide good privacy protections. We develop a technique called ``smooth flipping probability'' that incorporates this intuitive privacy benefit of SignRPs and improves the standard DP bit flipping strategy. Based on this technique, we propose DP-SignOPORP which satisfies strict DP and outperforms other DP variants based on SignRP (and RP), especially when $\epsilon$ is not very large (e.g., $\epsilon = 5\sim10$). Moreover, if an application scenario accepts individual DP, then we immediately obtain an algorithm named iDP-SignRP which achieves excellent utilities even at small~$\epsilon$ (e.g., $\epsilon<0.5$).
arXiv.org Artificial Intelligence
Jun-13-2023
- Country:
- Africa > Rwanda
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- China > Beijing
- Beijing (0.04)
- Japan > Honshū
- Kantō
- Kanagawa Prefecture > Yokohama (0.04)
- Tokyo Metropolis Prefecture > Tokyo (0.14)
- Kantō
- Afghanistan > Parwan Province
- Europe
- Austria > Vienna (0.14)
- France > Hauts-de-France
- Iceland > Capital Region
- Reykjavik (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Spain > Canary Islands (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- North America
- Canada
- British Columbia
- Ontario > Toronto (0.14)
- Quebec > Montreal (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- Santa Barbara County > Santa Barbara (0.04)
- Santa Clara County
- Santa Clara (0.04)
- Stanford (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- District of Columbia > Washington (0.04)
- Washington > King County
- Bellevue (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- New Jersey
- Mercer County > Princeton (0.04)
- Middlesex County > New Brunswick (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Maryland
- Baltimore (0.04)
- Montgomery County > Bethesda (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York
- Kings County > New York City (0.04)
- New York County > New York City (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Texas
- Dallas County > Dallas (0.04)
- Harris County > Houston (0.04)
- Travis County > Austin (0.04)
- Arizona > Maricopa County
- Scottsdale (0.04)
- California
- Canada
- Oceania > Australia
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Performance Analysis > Accuracy (0.67)
- Statistical Learning (1.00)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Machine Learning
- Data Science > Data Mining (1.00)
- Security & Privacy (1.00)
- Artificial Intelligence
- Information Technology