Audio Adversarial Examples: Attacks Using Vocal Masks

Tay, Kai Yuan, Ng, Lynnette, Chua, Wei Han, Loke, Lucerne, Ye, Danqi, Chua, Melissa

Feb-5-2021–arXiv.org Artificial Intelligence

We construct audio adversarial examples on automatic Speech-To-Text systems . Given any audio waveform, we produce an another by overlaying an audio vocal mask generated from the original audio. We apply our audio adversarial attack to five SOTA STT systems: DeepSpeech, Julius, Kaldi, wav2letter@anywhere and CMUSphinx. In addition, we engaged human annotators to transcribe the adversarial audio. Our experiments show that these adversarial examples fool State-Of-The-Art Speech-To-Text systems, yet humans are able to consistently pick out the speech. The feasibility of this attack introduces a new domain to study machine and human perception of speech.

adversarial example, audio, stt system, (16 more...)

arXiv.org Artificial Intelligence

Feb-5-2021

arXiv.org PDF

Add feedback

Country:
- Asia (0.04)
- North America > United States
  - New York > New York County > New York City (0.04)
- Europe > Italy
  - Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:
- Research Report (0.82)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Speech > Speech Recognition (1.00)
    - Natural Language (1.00)
    - Representation & Reasoning (0.94)
    - Machine Learning
      - Neural Networks > Deep Learning (0.69)
      - Learning Graphical Models > Undirected Networks
        Markov Models (0.31)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found