LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks

Costa, Joana C., Roxo, Tiago, Proença, Hugo, Inácio, Pedro R. M.

Feb-27-2025–arXiv.org Artificial Intelligence

--State-of-the-art defense mechanisms are typically evaluated in the context of white-box attacks, which is not realistic, as it assumes the attacker can access the gradients of the target network. T o protect against this scenario, Adversarial Training (A T) and Adversarial Distillation (AD) include adversarial examples during the training phase, and Adversarial Purification uses a generative model to reconstruct all the images given to the classifier . This paper considers an even more realistic evaluation scenario: gray-box attacks, which assume that the attacker knows the architecture and the dataset used to train the target network, but cannot access its gradients. We provide empirical evidence that models are vulnerable to gray-box attacks and propose LISArD, a defense mechanism that does not increase computational and temporal costs but provides robustness against gray-box and white-box attacks without including A T . Our method approximates a cross-correlation matrix, created with the embeddings of perturbed and clean images, to a diagonal matrix while simultaneously conducting classification learning. Our results show that LISArD can effectively protect against gray-box attacks, can be used in multiple architectures, and carries over its resilience to the white-box scenario. Also, state-of-the-art AD models underperform greatly when removing A T and/or moving to gray-box settings, highlighting the lack of robustness from existing approaches to perform in various conditions (aside from white-box settings). EEP Neural Networks (DNNs) have achieved remarkable performance in multiple areas, such as Medical Imaging [1], [2], Natural Language Processing [3], [4], and Active Speaker Detection [5]-[7]. This accomplishment led to the wide adoption of Artificial Intelligence in the daily lives of many people, either in work or leisure scenarios, increasing the attractiveness and susceptibility of DNNs to attackers. The study of DNN security is still in its early stages.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Feb-27-2025

arXiv.org PDF

Add feedback

Country:
- Europe
  - Portugal (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- Africa > Mozambique
  - Sofala Province > Beira (0.05)

Genre:
- Research Report > New Finding (0.86)

Industry:
- Information Technology > Security & Privacy (1.00)
- Education (0.93)
- Health & Medicine > Diagnostic Medicine
  - Imaging (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found