Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits

Open in new window