Na'vi or Knave: Jailbreaking Language Models via Metaphorical Avatars

Open in new window