Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots Ruixiang T ang
–Neural Information Processing Systems
In the field of natural language processing, the prevalent approach involves fine-tuning pretrained language models (PLMs) using local samples.
Neural Information Processing Systems
Feb-17-2026, 17:25:28 GMT
- Country:
- Asia
- China > Zhejiang Province
- Hangzhou (0.04)
- Nepal (0.04)
- China > Zhejiang Province
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Information Technology > Security & Privacy (0.73)
- Technology: