SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer

Open in new window