NCL: Textual Backdoor Defense Using Noise-augmented Contrastive Learning

Zhai, Shengfang, Shen, Qingni, Chen, Xiaoyi, Wang, Weilong, Li, Cong, Fang, Yuejian, Wu, Zhonghai

Mar-3-2023–arXiv.org Artificial Intelligence

At present, backdoor attacks attract attention as they do great harm to deep learning models. The adversary poisons the training data making the model being injected with a backdoor after being trained unconsciously by victims using the poisoned dataset. In the field of text, however, existing works do not provide sufficient defense against backdoor attacks. In this paper, we propose a Noise-augmented Contrastive Learning (NCL) framework to defend against textual backdoor attacks when training models with untrustworthy data. With the aim of mitigating the mapping between triggers and the target label, we add appropriate noise perturbing possible backdoor triggers, augment the training dataset, and then pull homology samples in the feature space utilizing contrastive learning objective. Experiments demonstrate the effectiveness of our method in defending three types of textual backdoor attacks, outperforming the prior works.

artificial intelligence, backdoor attack, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-3-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.40)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found