Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization

Open in new window