Private Training Large-scale Models with Efficient DP-SGD
–Neural Information Processing Systems
As large language models (LLMs) increasingly underpin technological advancements, the privacy of their training data emerges as a critical concern. Differential Privacy (DP) serves as a rigorous mechanism to protect this data, yet its integration via Differentially Private Stochastic Gradient Descent (DP-SGD) introduces substantial challenges, primarily due to the complexities of per-sample gradient clipping.
Neural Information Processing Systems
Jun-11-2026, 07:41:45 GMT
- Technology: