SpecWav-Attack: Leveraging Spectrogram Resizing and Wav2Vec 2.0 for Attacking Anonymized Speech

Li, Yuqi, Zheng, Yuanzhong, Guo, Zhongtian, Wang, Yaoxuan, Yin, Jianjun, Fei, Haojun

May-16-2025–arXiv.org Artificial Intelligence

--This paper presents SpecWav-Attack, an adversarial model for detecting speakers in anonymized speech. It leverages Wav2V ec2 for feature extraction [1] and incorporates spectrogram resizing and incremental training for improved performance. Evaluated on librispeech-dev and librispeech-test, SpecWav-Attack outperforms conventional attacks, revealing vulnerabilities in anonymized speech systems and emphasizing the need for stronger defenses, benchmarked against the ICASSP 2025 Attacker Challenge [2]. This paper introduces SpecWav-Attack, a tailored adversarial model for attacking anonymized speech with a focus on Effective Equal Error Rate (EER). Using the ECAP A-TDNN architecture [3], we integrate the Wav2V ec2 self-supervised model [1] to enrich speech representations, enhancing sensitivity to variations in anonymized data.

artificial intelligence, machine learning, specwav-attack, (14 more...)

arXiv.org Artificial Intelligence

May-16-2025

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.24)

Genre:
- Research Report (0.65)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found