Synthetic Speech Source Tracing using Metric Learning
Koutsianos, Dimitrios, Zacharopoulos, Stavros, Panagakis, Yannis, Stafylakis, Themos
–arXiv.org Artificial Intelligence
This paper addresses source tracing in synthetic speech--identifying generative systems behind manipulated audio via speaker recognition-inspired pipelines. While prior work focuses on spoofing detection, source tracing lacks robust solutions. We evaluate two approaches: classification-based and metric-learning. We tested our methods on the MLAADv5 benchmark using ResNet and self-supervised learning (SSL) backbones. The results show that ResNet achieves competitive performance with the metric learning approach, matching and even exceeding SSL-based systems. Our work demonstrates ResNet's viability for source tracing while underscoring the need to optimize SSL representations for this task.
arXiv.org Artificial Intelligence
Jun-4-2025
- Country:
- North America > United States (0.04)
- Europe > Greece
- Genre:
- Research Report > New Finding (0.48)
- Technology: