CHAI: Command Hijacking against embodied AI
Burbano, Luis, Ortiz, Diego, Sun, Qi, Yang, Siwei, Tu, Haoqin, Xie, Cihang, Cao, Yinzhi, Cardenas, Alvaro A
–arXiv.org Artificial Intelligence
Embodied Artificial Intelligence (AI) promises to handle edge cases in robotic vehicle systems where data is scarce by using common-sense reasoning grounded in perception and action to generalize beyond training distributions and adapt to novel real-world situations. These capabilities, however, also create new security risks. In this paper, we introduce CHAI (Command Hijacking against embodied AI), a new class of prompt-based attacks that exploit the multimodal language interpretation abilities of Large Visual-Language Models (LVLMs). CHAI embeds deceptive natural language instructions, such as misleading signs, in visual input, systematically searches the token space, builds a dictionary of prompts, and guides an attacker model to generate Visual Attack Prompts. We evaluate CHAI on four LVLM agents; drone emergency landing, autonomous driving, and aerial object tracking, and on a real robotic vehicle. Our experiments show that CHAI consistently outperforms state-of-the-art attacks. By exploiting the semantic and multimodal reasoning strengths of next-generation embodied AI systems, CHAI underscores the urgent need for defenses that extend beyond traditional adversarial robustness.
arXiv.org Artificial Intelligence
Oct-2-2025
- Country:
- Asia > Middle East
- Republic of Türkiye > Karaman Province > Karaman (0.04)
- North America
- Canada > British Columbia
- Vancouver (0.04)
- United States > California
- Orange County > Anaheim (0.04)
- Santa Cruz County > Santa Cruz (0.04)
- Canada > British Columbia
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Technology: