Understanding Gradient Clipping in Private SGD: A Geometric Perspective

Oct-10-2024, 22:51:18 GMT–Neural Information Processing Systems

Deep learning models are increasingly popular in many machine learning applications where the training data may contain sensitive information. To provide formal and rigorous privacy guarantee, many learning systems now incorporate differential privacy by training their models with (differentially) private SGD. A key step in each private SGD update is gradient clipping that shrinks the gradient of an individual example whenever its l2 norm exceeds a certain threshold. We first demonstrate how gradient clipping can prevent SGD from converging to a stationary point. We then provide a theoretical analysis on private SGD with gradient clipping.

geometric perspective, gradient, gradient distribution, (2 more...)

Neural Information Processing Systems

Oct-10-2024, 22:51:18 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)