D ej ` a vu Memorization in Vision-Language Models

Oct-10-2025, 03:34:55 GMT–Neural Information Processing Systems

Vision-Language Models (VLMs) have emerged as the state-of-the-art representation learning solution, with myriads of downstream applications such as image classification, retrieval and generation. A natural question is whether these models memorize their training data, which also has implications for generalization. We propose a new method for measuring memorization in VLMs, which we call d ej ` a vu memorization . For VLMs trained on image-caption pairs, we show that the model indeed retains information about individual objects in the training images beyond what can be inferred from correlations or the image caption. We evaluate d ej ` a vu memorization at both sample and population level, and show that it is significant for OpenCLIP trained on as many as 50M image-caption pairs. Finally, we show that text randomization considerably mitigates memorization while only moderately impacting the model's downstream task performance.

information, memorization, vu memorization, (15 more...)

Neural Information Processing Systems

Oct-10-2025, 03:34:55 GMT

Conferences PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States
  - California (0.04)
- Europe
  - Poland (0.04)
  - Switzerland > Zürich
    - Zürich (0.14)
- Africa > Central African Republic
  - Ombella-M'Poko > Bimbo (0.04)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Machine Learning > Memory-Based Learning
    - Rote Learning (1.00)

Duplicate Docs Excel Report

Title
5ab6f836f464d0f4e4f6aaa523249280-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found