Prompt Attacks Reveal Superficial Knowledge Removal in Unlearning Methods

Open in new window