MORA: Improving Ensemble Robustness Evaluation with Model Reweighing Attack

Jan-18-2025, 12:29:33 GMT–Neural Information Processing Systems

Adversarial attacks can deceive neural networks by adding tiny perturbations to their input data. Ensemble defenses, which are trained to minimize attack transferability among sub-models, offer a promising research direction to improve robustness against such attacks while maintaining a high accuracy on natural inputs. We discover, however, that recent state-of-the-art (SOTA) adversarial attack strategies cannot reliably evaluate ensemble defenses, sizeably overestimating their robustness. This paper identifies the two factors that contribute to this behavior. First, these defenses form ensembles that are notably difficult for existing gradient-based method to attack, due to gradient obfuscation.

ensemble defense, ensemble robustness evaluation, model reweighing attack, (3 more...)

Neural Information Processing Systems

Jan-18-2025, 12:29:33 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.87)
- Government > Military (0.87)

Technology:
- Information Technology
  - Security & Privacy (0.87)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning (0.40)