Augmentation through Laundering Attacks for Audio Spoof Detection
Ali, Hashim, Subramani, Surya, Malik, Hafiz
–arXiv.org Artificial Intelligence
Recent text-to-speech (TTS) developments have made voice cloning (VC) more realistic, affordable, and easily accessible. This has given rise to many potential abuses of this technology, including Joe Biden's New Hampshire deepfake robocall. Several methodologies have been proposed to detect such clones. However, these methodologies have been trained and evaluated on relatively clean databases. Recently, ASVspoof 5 Challenge introduced a new crowd-sourced database of diverse acoustic conditions including various spoofing attacks and codec conditions. This paper is our submission to the ASVspoof 5 Challenge and aims to investigate the performance of Audio Spoof Detection, trained using data augmentation through laundering attacks, on the ASVSpoof 5 database. The results demonstrate that our system performs worst on A18, A19, A20, A26, and A30 spoofing attacks and in the codec and compression conditions of C08, C09, and C10.
arXiv.org Artificial Intelligence
Oct-1-2024
- Country:
- North America > United States
- New Hampshire (0.24)
- Michigan > Wayne County
- Dearborn (0.04)
- Europe
- Ukraine (0.14)
- United Kingdom > England
- Greater London > London (0.04)
- Asia
- Pakistan (0.04)
- Middle East > Palestine (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Technology:
- Information Technology
- Security & Privacy (1.00)
- Artificial Intelligence
- Vision (1.00)
- Speech (1.00)
- Machine Learning > Neural Networks (1.00)
- Information Technology