Google's AI detects adversarial attacks against image classifiers
Defenses against adversarial attacks, which in the context of AI refer to techniques that fool models through malicious input, are increasingly being broken by "defense-aware" attacks. In fact, most state-of-the-art methods claiming to detect adversarial attacks have been counteracted shortly after their publication. To break the cycle, researchers at the University of California, San Diego and Google Brain, including Turing Award winner Geoffrey Hinton, recently described in a preprint paper an approach that deflects attacks in the computer vision domain. Their framework either detects attacks accurately or, for undetected attacks, pressures the attackers to produce images that resemble the target class of images. The proposed architecture comprises (1) a network that classifies various input images from a data set and (2) a network that reconstructs the inputs conditioned on parameters of a predicted capsule.
Feb-25-2020, 07:00:32 GMT
- Country:
- North America > United States > California > San Diego County > San Diego (0.26)
- Genre:
- Research Report (0.57)
- Industry:
- Information Technology > Security & Privacy (0.88)
- Government > Military (0.88)
- Technology: