Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing

Open in new window