fb4c48608ce8825b558ccf07169a3421-Supplemental.pdf

Apr-27-2026, 22:53:20 GMT–Neural Information Processing Systems

In this section, we perform additional diagnostics that give us confidence that our models are not doing any form of gradient obfuscation or masking [3, 53]. First, we report in Table 4 the robust accuracy obtained by our strongest models against a diverse set of attacks. The cascade is composed as follows: AUTOPGD-CE, an untargeted attack using PGD with an adaptive step on the cross-entropy loss [10], AUTOPGD-T, a targeted attack using PGD with an adaptive step on the difference of logits ratio [10], FAB-T, a targeted attack which minimizes the norm of adversarial perturbations [9], SQUARE, a query-efficient black-box attack [1]. First, we observe that our combination of attacks, denoted AA+MT matches the final robust accuracy measured by AUTOATTACK. Second, we also notice that the black-box attack (i.e., SQUARE) does not find any additional adversarial examples.

accuracy, artificial intelligence, robust accuracy, (17 more...)

Neural Information Processing Systems

Apr-27-2026, 22:53:20 GMT

Conferences PDF

Add feedback

Industry:
- Transportation > Air (0.55)

Technology:
- Information Technology > Artificial Intelligence (0.70)

Duplicate Docs Excel Report

Title
fb4c48608ce8825b558ccf07169a3421-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found