Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation

Open in new window