Reviews: FastSpeech: Fast, Robust and Controllable Text to Speech
–Neural Information Processing Systems
The paper proposes a novel non-autoregressive parallelisation approach for mel-spectrogram intermediate representation TTS. The reviewers concur that the paper incorporates two novel explicit components to tts systems - length and duration modules and that the results on Speedup at inference and high-quality audio generations are relevant.
Neural Information Processing Systems
Jun-1-2025, 23:53:02 GMT
- Technology: