SupplementaryMaterial: BetterSafeThanSorry: PreventingDelusiveAdversarieswith AdversarialTraining

Feb-9-2026, 17:15:19 GMT–Neural Information Processing Systems

The initial learning rate is set to 0.1. A.2 AdversarialTraining Unless otherwise specified, we perform adversarial training to train robust classifiers by following Madry etal.[74]. Specifically,we train against aprojected gradient descent (PGD) adversary, starting from a random initial perturbation of the training data. Unless otherwise specified, we use the values of provided in Table 5 to train our models. We use 7 steps of PGD with a step size of/5. A.3 DelusiveAdversaries Six delusive attacks are considered to validate our proposed defense.

artificial intelligence, machine learning, rnat, (16 more...)

Neural Information Processing Systems

Feb-9-2026, 17:15:19 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > California
    - Santa Clara County > Palo Alto (0.04)
  - Canada > Ontario
    - Toronto (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
8726bb30dc7ce15023daa8ff8402bcfd-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found