Analysis of Speech Temporal Dynamics in the Context of Speaker Verification and Voice Anonymization

Tomashenko, Natalia, Vincent, Emmanuel, Tommasi, Marc

Dec-22-2024–arXiv.org Artificial Intelligence

Abstract--In this paper, we investigate the impact of speech methods use large-scale pre-trained models for extracting specific temporal dynamics in application to automatic speaker verification attributes and provide better content and privacy preservation than and speaker voice anonymization tasks. We propose several signal processing based methods. The diversity of approaches is metrics to perform automatic speaker verification based only illustrated by the VoicePrivacy 2024 Challenge [10], which provided on phoneme durations. Experimental results demonstrate that six baseline anonymization systems, namely anonymization using x-phoneme durations leak some speaker information and can reveal vectors and a neural source-filter model [6], [11], signal processing speaker identity from both original and anonymized speech. While specific studies have been dedicated to speaker information carried by pitch [5], [6], [8], the impact of speech temporal dynamics on speaker verification and re-identification has been overlooked.

anonymization, artificial intelligence, speech recognition, (15 more...)

arXiv.org Artificial Intelligence

Dec-22-2024

arXiv.org PDF

Add feedback

Country:
- Asia (0.04)
- North America > United States
  - Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Europe > France
  - Hauts-de-France > Nord
    - Lille (0.04)
  - Grand Est > Meurthe-et-Moselle
    - Nancy (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Information Technology > Security & Privacy (0.69)

Technology:
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found