Algebraic Adversarial Attacks on Explainability Models

Simpson, Lachlan, Costanza, Federico, Millar, Kyle, Cheng, Adriel, Lim, Cheng-Chew, Chew, Hong Gunn

Mar-16-2025–arXiv.org Artificial Intelligence

Classical adversarial attacks are phrased as a constrained optimisation problem. Despite the efficacy of a constrained optimisation approach to adversarial attacks, one cannot trace how an adversarial point was generated. In this work, we propose an algebraic approach to adversarial attacks and study the conditions under which one can generate adversarial examples for post-hoc explainability models. Phrasing neural networks in the framework of geometric deep learning, algebraic adversarial attacks are constructed through analysis of the symmetry groups of neural networks. Algebraic adversarial examples provide a mathematically tractable approach to adversarial examples. We validate our approach of algebraic adversarial examples on two well-known and one real-world dataset.

adversarial attack, artificial intelligence, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Mar-16-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - South Australia > Adelaide (0.04)
- North America > United States
  - Wisconsin (0.05)

Genre:
- Research Report (0.50)

Industry:
- Information Technology > Security & Privacy (1.00)
- Government > Military (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.90)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found