Differentially Private Decoding in Large Language Models

Majmudar, Jimit, Dupuy, Christophe, Peris, Charith, Smaili, Sami, Gupta, Rahul, Zemel, Richard

Sep-8-2022–arXiv.org Artificial Intelligence

Recent large-scale natural language processing (NLP) systems use a pre-trained Large Language Model (LLM) on massive and diverse corpora as a headstart. In practice, the pre-trained model is adapted to a wide array of tasks via fine-tuning on task-specific datasets. LLMs, while effective, have been shown to memorize instances of training data thereby potentially revealing private information processed during pre-training. The potential leakage might further propagate to the downstream tasks for which LLMs are fine-tuned. On the other hand, privacy-preserving algorithms usually involve retraining from scratch, which is prohibitively expensive for LLMs. In this work, we propose a simple, easy to interpret, and computationally lightweight perturbation mechanism to be applied to an already trained model at the decoding stage. Our perturbation mechanism is model-agnostic and can be used in conjunction with any LLM. We provide theoretical analysis showing that the proposed mechanism is differentially private, and experimental results showing a privacy-utility trade-off.

mechanism, memorization, predicted line, (13 more...)

arXiv.org Artificial Intelligence

Sep-8-2022

arXiv.org PDF

Add feedback

Country:
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Information Technology > Security & Privacy (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found