Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation

Open in new window