Discrete Optimal Transport and Voice Conversion
Selitskiy, Anton, Kocharekar, Maitreya
–arXiv.org Artificial Intelligence
In this work, we address the voice conversion (VC) task using a vector-based interface. To align audio embeddings between speakers, we employ discrete optimal transport mapping. Our evaluation results demonstrate the high quality and effectiveness of this method. Additionally, we show that applying discrete optimal transport as a post-processing step in audio generation can lead to the incorrect classification of synthetic audio as real.
arXiv.org Artificial Intelligence
Dec-2-2025
- Country:
- Europe > United Kingdom
- England > Greater London > London (0.04)
- North America > United States
- New York > Monroe County > Rochester (0.04)
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (0.48)
- Technology: