Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

Open in new window