Large Language Model Unlearning via Embedding-Corrupted Prompts

Neural Information Processing Systems 

Instead of relying on the LLM itself to unlearn, we enforce an unlearned state during inference by employing a prompt classifier to identify and safeguard prompts to forget.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found