Model-Editing-Based Jailbreak against Safety-aligned Large Language Models

Open in new window