SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification

Mar-22-2026, 16:38:24 GMT–Neural Information Processing Systems

The advancement of Machine Learning has enabled the widespread deployment of Machine Learning as a Service (MLaaS) applications. However, the untrustworthy nature of third-party ML services poses backdoor threats. Existing defenses in MLaaS are limited by their reliance on training samples or white-box model analysis, highlighting the need for a black-box backdoor purification method. In our paper, we attempt to use diffusion models for purification by introducing noise in a forward diffusion process to destroy backdoors and recover clean samples through a reverse generative process. However, since a higher noise also destroys the semantics of the original samples, it still results in a low restoration performance.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Mar-22-2026, 16:38:24 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.98)