Class-DisentanglementandApplicationsin AdversarialDetectionandDefense

Feb-9-2026, 16:37:21 GMT–Neural Information Processing Systems

What is the minimum necessary information required by a neural netD() from an image x to accurately predict its class? Extracting such information in the input space fromx can allocate the areasD() mainly attending to and shed novel insights to the detection and defense of adversarial attacks. In this paper, we propose "class-disentanglement" that trains a variational autoencoder G() to extract this class-dependent information asx G(x) via a trade-off between reconstructingx by G(x) and classifying x by D(x G(x)), where the former competes with the latter in decomposingx so the latter retains only necessary information for classification inx G(x).

artificial intelligence, arxivpreprintarxiv, machine learning, (17 more...)

Neural Information Processing Systems

Feb-9-2026, 16:37:21 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Performance Analysis > Accuracy (0.93)
  - Neural Networks (0.90)

Duplicate Docs Excel Report

Title
8606f35ec6c77858dfb80a385d0d1151-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found