Shadowcast: Stealthy Data Poisoning Attacks against Vision-Language Models

Mar-21-2025, 22:09:25 GMT–Neural Information Processing Systems

Vision-Language Models (VLMs) excel in generating textual responses from visual inputs, but their versatility raises security concerns. This study takes the first step in exposing VLMs' susceptibility to data poisoning attacks that can manipulate responses to innocuous, everyday prompts. We introduce Shadowcast, a stealthy data poisoning attack where poison samples are visually indistinguishable from benign images with matching texts. Shadowcast demonstrates effectiveness in two attack types. The first is a traditional Label Attack, tricking VLMs into misidentifying class labels, such as confusing Donald Trump for Joe Biden.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Mar-21-2025, 22:09:25 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Illinois (0.14)
  - Maryland (0.14)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Government
  - Military (1.00)
  - Regional Government > North America Government
    - United States Government (1.00)
- Health & Medicine > Consumer Health (1.00)
- Information Technology > Security & Privacy (1.00)
- Leisure & Entertainment (0.94)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)
  - Natural Language (1.00)
  - Vision (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found