LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Open in new window