How stealthy is stealthy? Studying the Efficacy of Black-Box Adversarial Attacks in the Real World

Panebianco, Francesco, D'Onghia, Mario, Carminati, Stefano Zanero aand Michele

arXiv.org Artificial Intelligence 

Deep learning systems, critical in domains like autonomous vehicles, are vulnerable to adversarial examples (crafted inputs designed to mislead classifiers). This study investigates black-box adversarial attacks in computer vision. This is a realistic scenario, where attackers have query-only access to the target model. Three properties are introduced to evaluate attack feasibility: robustness to compression, stealthiness to automatic detection, and stealthiness to human inspection. State-of-the-Art methods tend to prioritize one criterion at the expense of others. We propose ECLIPSE, a novel attack method employing Gaussian blurring on sampled gradients and a local surrogate model.