Goto

Collaborating Authors

 detect adversarial attack


Vision Transformer with Adversarial Indicator Token against Adversarial Attacks in Radio Signal Classifications

arXiv.org Artificial Intelligence

--The remarkable success of transformers across various fields such as natural language processing and computer vision has paved the way for their applications in automatic modulation classification, a critical component in the communication systems of Internet of Things (IoT) devices. However, it has been observed that transformer-based classification of radio signals is susceptible to subtle yet sophisticated adversarial attacks. T o address this issue, we have developed a defensive strategy for transformer-based modulation classification systems to counter such adversarial attacks. In this paper, we propose a novel vision transformer (ViT) architecture by introducing a new concept known as adversarial indicator (AdvI) token to detect adversarial attacks. T o the best of our knowledge, this is the first work to propose an AdvI token in ViT to defend against adversarial attacks. Integrating an adversarial training method with a detection mechanism using AdvI token, we combine a training time defense and running time defense in a unified neural network model, which reduces architectural complexity of the system compared to detecting adversarial perturbations using separate models. We investigate into the operational principles of our method by examining the attention mechanism. We show the proposed AdvI token acts as a crucial element within the ViT, influencing attention weights and thereby highlighting regions or features in the input data that are potentially suspicious or anomalous. Through experimental results, we demonstrate that our approach surpasses several competitive methods in handling white-box attack scenarios, including those utilizing the fast gradient method, projected gradient descent attacks and basic iterative method. Lu Zhang is with School of Mathematics and Computer Science, Swansea university, Swansea, SA1 8EN, UK (e-mail: lu.zhang@swansea.ac.uk). Sangarapillai Lambotharan is with Institute for Digital Technologies, Loughborough University London, London, E20 3BS, UK (e-mail: s.lambotharan@lboro.ac.uk). Gan Zheng is with School of Engineering, University of Warwick, Coventry, CV4 7AL, UK (e-mail: gan.zheng@warwick.ac.uk). Guisheng Liao is with School of Electronic Engineering, Xidian University, Xi'an, 710071, People's Republic of China (e-mail: liaogs@xidian.edu.cn). Xuekang Liu is with the Department of Electrical and Electronic Engineering, Faculty of Engineering, Imperial College London, London, SW7 2AZ, U.K. (e-mail: xuekangliu@ieee.org).


Google's AI detects adversarial attacks against image classifiers

#artificialintelligence

Defenses against adversarial attacks, which in the context of AI refer to techniques that fool models through malicious input, are increasingly being broken by "defense-aware" attacks. In fact, most state-of-the-art methods claiming to detect adversarial attacks have been counteracted shortly after their publication. To break the cycle, researchers at the University of California, San Diego and Google Brain, including Turing Award winner Geoffrey Hinton, recently described in a preprint paper an approach that deflects attacks in the computer vision domain. Their framework either detects attacks accurately or, for undetected attacks, pressures the attackers to produce images that resemble the target class of images. The proposed architecture comprises (1) a network that classifies various input images from a data set and (2) a network that reconstructs the inputs conditioned on parameters of a predicted capsule.