Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense
–Neural Information Processing Systems
However, Does achieving a low ASR through current safety purification methods truly eliminate learned backdoor features from the pretraining phase? In this paper, we provide an affirmative answer to this question by thoroughly investigating the Post-Purification Robustness of current backdoor purification methods.
Neural Information Processing Systems
Feb-16-2026, 13:52:43 GMT
- Country:
- North America > United States
- Pennsylvania (0.04)
- Europe > Latvia
- Lubāna Municipality > Lubāna (0.04)
- Asia > China
- Hong Kong (0.04)
- North America > United States
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.93)
- Research Report
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: