SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model

Open in new window