Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations

Neural Information Processing Systems 

We present a neural analysis and synthesis (NANSY) framework that can manipulate voice, pitch, and speed of an arbitrary speech signal. Most of the previous works have focused on using information bottleneck to disentangle analysis features for controllable synthesis, which usually results in poor reconstruction quality.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found