Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Open in new window