Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge

Open in new window