Voice Cloning: Comprehensive Survey

Azzuni, Hussam, Saddik, Abdulmotaleb El

arXiv.org Artificial Intelligence 

--V oice Cloning has rapidly advanced in today's digital world, with many researchers and corporations working to improve these algorithms for various applications. This article aims to establish a standardized terminology for voice cloning and explore its different variations. It will cover speaker adaptation as the fundamental concept and then delve deeper into topics such as few-shot, zero-shot, and multilingual TTS within that context. Finally, we will explore the evaluation metrics commonly used in voice cloning research and related datasets. This survey compiles the available voice cloning algorithms to encourage research toward its generation and detection to limit its misuse. OICE Cloning is the ability to replicate a person's voice. Advancing these algorithms relies on enhancing the performance of Text-to-Speech (TTS) systems in various areas, including speech quality, naturalness, prosody, and timbre, ensuring the produced voice closely resembles the target speaker.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found