ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training
Yao, Xin, Zhao, Haiyang, Chen, Yimin, Guo, Jiawei, Huang, Kecheng, Zhao, Ming
–arXiv.org Artificial Intelligence
The Contrastive Language-Image Pretraining (CLIP) model has significantly advanced vision-language modeling by aligning image-text pairs from large-scale web data through self-supervised contrastive learning. Yet, its reliance on uncurated Internet-sourced data exposes it to data poisoning and backdoor risks. While existing studies primarily investigate image-based attacks, the text modality, which is equally central to CLIP's training, remains underexplored. In this work, we introduce ToxicTextCLIP, a framework for generating high-quality adversarial texts that target CLIP during the pre-training phase. The framework addresses two key challenges: semantic misalignment caused by background inconsistency with the target class, and the scarcity of background-consistent texts. To this end, ToxicTextCLIP iteratively applies: 1) a background-aware selector that prioritizes texts with background content aligned to the target class, and 2) a background-driven augmenter that generates semantically coherent and diverse poisoned samples. Extensive experiments on classification and retrieval tasks show that ToxicTextCLIP achieves up to 95.83% poisoning success and 98.68% backdoor Hit@1, while bypassing RoCLIP, CleanCLIP and SafeCLIP defenses. The source code can be accessed via https://github.com/xinyaocse/ToxicTextCLIP/.
arXiv.org Artificial Intelligence
Nov-4-2025
- Country:
- Asia
- China > Hunan Province (0.04)
- Middle East
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- Jordan (0.04)
- Israel > Tel Aviv District
- Europe
- North America
- Canada > Ontario
- National Capital Region > Ottawa (0.04)
- Dominican Republic (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.14)
- Florida > Miami-Dade County
- Miami (0.04)
- Hawaii (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- Massachusetts > Middlesex County
- Lowell (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California
- Canada > Ontario
- Oceania > Australia
- Asia
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: