Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction

Open in new window