Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Mar-19-2025, 07:14:10 GMT–Neural Information Processing Systems

To mitigate memorization, we introduce a subtle modification to the next-token training objective that we call the goldfish loss. During training, a randomly sampled subsets of tokens are excluded from the loss computation. These dropped tokens are not memorized by the model, which prevents verbatim reproduction of a complete chain of tokens from the training set. We run extensive experiments training billion-scale LLaMA-2 models, both pre-trained and trained from scratch, and demonstrate significant reductions in extractable memorization with little to no impact on downstream benchmarks.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Mar-19-2025, 07:14:10 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Government > Regional Government
  - North America Government > United States Government (0.46)
- Information Technology > Security & Privacy (0.46)
- Law (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Memory-Based Learning > Rote Learning (0.85)
    - Neural Networks > Deep Learning (0.66)
    - Performance Analysis > Accuracy (0.94)
  - Natural Language > Large Language Model (1.00)