Efficient Video-to-Audio Generation Network with Rectified Flow Matching

Open in new window