Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
–Neural Information Processing Systems
However, Does achieving a low ASR through current safety purification methods truly eliminate learned backdoor features from the pretraining phase? In this paper, we provide an affirmative answer to this question by thoroughly investigating the Post-Purification Robustness of current backdoor purification methods.
Neural Information Processing Systems
Feb-16-2026, 13:52:43 GMT
- Country:
- Asia > China
- Hong Kong (0.04)
- Europe > Latvia
- Lubāna Municipality > Lubāna (0.04)
- North America > United States
- Pennsylvania (0.04)
- Asia > China
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (1.00)
- Research Report
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: