Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning
Gao, Fengyu, Zhou, Ruida, Wang, Tianhao, Shen, Cong, Yang, Jing
Large Language Models (LLMs) rely on the contextual information embedded in examples/demonstrations to perform in-context learning (ICL). To mitigate the risk of LLMs potentially leaking private information contained in examples in the prompt, we introduce a novel data-adaptive differentially private algorithm called AdaDPSyn to generate synthetic examples from the private dataset and then use these synthetic examples to perform ICL. The objective of AdaDPSyn is to adaptively adjust the noise level in the data synthesis mechanism according to the inherent statistical properties of the data, thereby preserving high ICL accuracy while maintaining formal differential privacy guarantees. A key innovation in AdaDPSyn is the Precision-Focused Iterative Radius Reduction technique, which dynamically refines the aggregation radius - the scope of data grouping for noise addition - based on patterns observed in data clustering, thereby minimizing the amount of additive noise. We conduct extensive experiments on standard benchmarks and compare AdaDPSyn with DP few-shot generation algorithm (Tang et al., 2023). The experiments demonstrate that AdaDPSyn not only outperforms DP few-shot generation, but also maintains high accuracy levels close to those of non-private baselines, providing an effective solution for ICL with privacy protection.
Oct-15-2024
- Country:
- Oceania > Australia
- New South Wales (0.04)
- South Australia (0.04)
- North America
- Canada > Prince Edward Island (0.04)
- United States
- Virginia (0.14)
- Pennsylvania (0.04)
- Maryland (0.04)
- New York > New York County
- New York City (0.04)
- Kentucky > Jefferson County
- Louisville (0.04)
- Indiana > Monroe County
- Bloomington (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Los Angeles (0.14)
- Alameda County > Berkeley (0.04)
- Europe
- France (0.04)
- Russia > Northwestern Federal District
- Leningrad Oblast > Saint Petersburg (0.04)
- Atlantic Ocean > North Atlantic Ocean
- Chesapeake Bay (0.04)
- Asia
- Africa > Kenya
- North Eastern Province (0.04)
- Eastern Province (0.04)
- Oceania > Australia
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Media > Film (1.00)
- Information Technology > Security & Privacy (1.00)
- Leisure & Entertainment > Sports
- Cricket (0.92)
- Government > Regional Government
- Technology: