Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
Ju, Tianjie, Hua, Yi, Fei, Hao, Shao, Zhenyu, Zheng, Yubin, Zhao, Haodong, Lee, Mong-Li, Hsu, Wynne, Zhang, Zhuosheng, Liu, Gongshen
–arXiv.org Artificial Intelligence
Multi-Modal Large Language Models (MLLMs) have exhibited remarkable performance on various vision-language tasks such as Visual Question Answering (VQA). Despite accumulating evidence of privacy concerns associated with task-relevant content, it remains unclear whether MLLMs inadvertently memorize private content that is entirely irrelevant to the training tasks. In this paper, we investigate how randomly generated task-irrelevant private content can become spuriously correlated with downstream objectives due to partial mini-batch training dynamics, thus causing inadvertent memorization. Concretely, we randomly generate task-irrelevant watermarks into VQA fine-tuning images at varying probabilities and propose a novel probing framework to determine whether MLLMs have inadvertently encoded such content. Our experiments reveal that MLLMs exhibit notably different training behaviors in partial mini-batch settings with task-irrelevant watermarks embedded. Furthermore, through layer-wise probing, we demonstrate that MLLMs trigger distinct representational patterns when encountering previously seen task-irrelevant knowledge, even if this knowledge does not influence their output during prompting. Our code is available at https://github.com/illusionhi/ProbingPrivacy.
arXiv.org Artificial Intelligence
Mar-3-2025
- Country:
- Oceania > Australia (0.04)
- North America
- United States
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Washington > King County
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Austria > Vienna (0.14)
- Czechia > Prague (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Italy > Piedmont
- Turin Province > Turin (0.04)
- France > Île-de-France
- Asia
- Africa > Rwanda
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: