Information-Theoretical Principled Trade-off between Jailbreakability and Stealthiness on Vision Language Models

Open in new window