Do Methods to Jailbreak and Defend LLMs Generalize Across Languages?

Open in new window