Debate with Images: Detecting Deceptive Behaviors in Multimodal Large Language Models

Open in new window