Elastic Robust Unlearning of Specific Knowledge in Large Language Models

Jun-10-2026, 04:39:50 GMT–Neural Information Processing Systems

LLM unlearning aims to remove sensitive or harmful information within the model, thus reducing the potential risk of generating unexpected information. However, existing Preference Optimization (PO)-based unlearning methods suffer two limitations. First, their rigid reward setting limits the effect of unlearning.

artificial intelligence, large language model, natural language, (8 more...)

Neural Information Processing Systems

Jun-10-2026, 04:39:50 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)