Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense

Feb-16-2026, 13:52:43 GMT–Neural Information Processing Systems

However, Does achieving a low ASR through current safety purification methods truly eliminate learned backdoor features from the pretraining phase? In this paper, we provide an affirmative answer to this question by thoroughly investigating the Post-Purification Robustness of current backdoor purification methods.

data mining, machine learning, purified model, (20 more...)

Neural Information Processing Systems

Feb-16-2026, 13:52:43 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania (0.04)
- Europe > Latvia
  - Lubāna Municipality > Lubāna (0.04)
- Asia > China
  - Hong Kong (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.93)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Data Science > Data Mining (0.67)
  - Artificial Intelligence
    - Natural Language (0.68)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
8e8399e5e7aed601c9f135f40be26564-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found