Wisdom is Knowing What not to Say Hallucination Free LLMs Unlearning via Attention Shifting
–Neural Information Processing Systems
The increase in computing power and the necessity of AI-assisted decision-making boost the growing application of Large Language Models (LLMs). Along with this, the potential retention of sensitive data of LLMs has spurred increasing research into machine unlearning. However, existing unlearning approaches face a critical dilemma: Aggressive unlearning compromises model utility, while conservative strategies preserve utility but risk hallucinated responses. This significantly limits LLMs' reliability in knowledge-intensive applications. To address this, we introduce a novel Attention-Shifting (AS) framework for selective unlearning.
Neural Information Processing Systems
Jun-19-2026, 17:13:35 GMT
- Genre:
- Overview (0.67)
- Research Report
- Experimental Study (1.00)
- New Finding (0.67)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: