Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Yang, Puyudi, Chen, Jianbo, Hsieh, Cho-Jui, Wang, Jane-Ling, Jordan, Michael I.

May-31-2018–arXiv.org Machine Learning

Robustness to adversarial perturbation has become an extremely important criterion for applications of machine learning in security-sensitive domains such as spam detection [25], fraud detection [6], criminal justice [3], malware detection [13], and financial markets [27]. Systematic methods for generating adversarial examples by small perturbations of original input data, also known as "attack," have been developed to operationalize this criterion and to drive the development of more robust learning systems [4, 26, 7]. Most of the work in this area has focused on differentiable models with continuous input spaces [26, 7, 14, 14]. In this setting, the proposed attack strategies add a gradient-based perturbation to the original input. It has been shown that such perturbations can result in a dramatic decrease in the predictive accuracy of the model. Thus this line of research has demonstrated the vulnerability of deep neural networks to adversarial examples in tasks like image classification and speech recognition. We focus instead on adversarial attacks on models with discrete input data, such as text data, where each feature of an input sample has a categorical domain. While gradient-based approaches are not directly applicable to this setting, variations of gradient-based approaches have been shown effective in differentiable models. For example, Li et al. [15] proposed to locate the top features with the largest gradient magnitude of their embedding, and Papernot et al. [20] proposed to modify randomly selected features of an input through perturbing each feature by signs of the gradient, and project them onto the closest vector in the embedding space.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Machine Learning

May-31-2018

arXiv.org PDF

Add feedback

Country:
- Africa
  - Nigeria > Federal Capital Territory
    - Abuja (0.04)
  - South Africa (0.28)
  - Sudan (0.04)
- Asia
  - Cambodia (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
  - Middle East
    - Iran > Tehran Province
      - Tehran (0.04)
    - Israel (0.04)
    - Jordan (0.04)
  - Russia (0.14)
  - Singapore (0.04)
  - Vietnam (0.04)
- Europe
  - Germany (0.04)
  - Italy (0.04)
  - Spain > Galicia
    - Madrid (0.04)
  - United Kingdom > England
    - North Yorkshire > Middlesbrough (0.04)
- North America
  - Haiti (0.28)
  - United States
    - Alabama (0.04)
    - California
      - Alameda County > Berkeley (0.04)
      - Los Angeles County > Los Angeles (0.04)
      - Yolo County > Davis (0.04)
    - Florida (0.04)
    - Hawaii (0.04)
    - Kansas > Shawnee County
      - Topeka (0.04)
    - Michigan (0.04)
    - New York (0.04)
- Pacific Ocean (0.04)
- South America
  - Argentina (0.04)
  - Chile (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Banking & Finance (1.00)
- Government
  - Military (1.00)
  - Regional Government > North America Government
    - United States Government (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (1.00)
- Law Enforcement & Public Safety (1.00)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (1.00)
  - Security & Privacy (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found