BadPrompt: Backdoor Attacks on Continuous Prompts

Dec-25-2025, 16:38:06 GMT–Neural Information Processing Systems

The prompt-based learning paradigm has gained much research attention recently. It has achieved state-of-the-art performance on several NLP tasks, especially in the few-shot scenarios. While steering the downstream tasks, few works have been reported to investigate the security problems of the prompt-based models. In this paper, we conduct the first study on the vulnerability of the continuous prompt learning algorithm to backdoor attacks. We observe that the few-shot scenarios have posed a great challenge to backdoor attacks on the prompt-based models, limiting the usability of existing NLP backdoor methods.

backdoor attack, badprompt, name change, (6 more...)

Neural Information Processing Systems

Dec-25-2025, 16:38:06 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Security & Privacy (0.89)
  - Artificial Intelligence > Machine Learning (0.77)