Towards Benign Memory Forgetting for Selective Multimodal Large Language Model Unlearning

Zeng, Zhen, Gu, Leijiang, Duan, Zhangling, Li, Feng, Shi, Zenglin, Snoek, Cees G. M., Wang, Meng

Nov-26-2025–arXiv.org Artificial Intelligence

Multimodal Large Language Models (MLLMs) achieve remarkable capabilities but can inadvertently memorize privacy-sensitive information. Although existing unlearning methods can remove such knowledge, they fail to achieve benign forgetting because they often degrade the model's general image understanding performance. T o address this, we propose the Sculpted Memory F orget-ting Adapter (SMF A), which confines forgetting to targeted memory regions while preserving overall capabilities. SMF A first fine-tunes the model to replace sensitive responses with refusals, yielding a memory forgetting adapter, and then applies a retaining anchor-guided masking mechanism to prevent interference with unrelated knowledge and understanding ability. T o systematically evaluate selective MLLM unlearning, we introduce S-MLLMUn Bench, the first benchmark designed to jointly assess the removal of sensitive knowledge and retention of general visual understanding. Extensive experiments show that, unlike prior methods, SMF A achieves precise and controllable unlearning while maintaining the model's foundational image understanding.

artificial intelligence, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

Nov-26-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.82)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found