GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models

May-29-2025, 08:33:20 GMT–Neural Information Processing Systems

Current studies on adversarial robustness mainly focus on aggregating local robustness results from a set of data samples to evaluate and rank different models. However, the local statistics may not well represent the true global robustness of the underlying unknown data distribution. To address this challenge, this paper makes the first attempt to present a new framework, called GREAT Score, for global robustness evaluation of adversarial perturbation using generative models. Formally, GREAT Score carries the physical meaning of a global statistic capturing a mean certified attack-proof perturbation level over all samples drawn from a generative model. For finite-sample evaluation, we also derive a probabilistic guarantee on the sample complexity and the difference between the sample mean and the true mean. GREAT Score has several advantages: (1) Robustness evaluations using GREAT Score are efficient and scalable to large models, by sparing the need of running adversarial attacks. In particular, we show high correlation and significantly reduced computation cost of GREAT Score when compared to the attack-based model ranking on RobustBench [12].

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

May-29-2025, 08:33:20 GMT

Conferences PDF

Add feedback

Country:
- Asia > China
  - Hong Kong (0.14)
- North America > United States (0.14)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (0.67)

Industry:
- Information Technology > Security & Privacy (0.88)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)
  - Natural Language > Generation (0.83)
  - Representation & Reasoning (1.00)
  - Vision (1.00)