Discrete optimal transport is a strong audio adversarial attack
Selitskiy, Anton, Shahriyar, Akib, Prakasan, Jishnuraj
–arXiv.org Artificial Intelligence
DISCRETE OPTIMAL TRANSPORT IS A STRONG AUDIO ADVERSARIAL A TT ACK A. Selitskiy, ABSTRACT In this paper, we show that discrete optimal transport (DOT) is an effective black-box adversarial attack against modern audio anti-spoofing countermeasures (CMs). Our attack operates as a post-processing, distribution-alignment step: frame-level WavLM embeddings of generated speech are aligned to an unpaired bona fide pool via entropic OT and a top-k barycentric projection, then decoded with a neural vocoder. Evaluated on ASVspoof2019 and ASVspoof5 with AASIST baselines, DOT yields consistently high equal error rate (EER) across datasets and remains competitive after CM fine-tuning, outperforming several conventional attacks in cross-dataset transfer. Ablation analysis highlights the practical impact of vocoder overlap. Results indicate that distribution-level alignment is a powerful and stable attack surface for deployed CMs.
arXiv.org Artificial Intelligence
Sep-19-2025