Efficient Video-to-Audio Generation Network with Rectified Flow Matching Y ongqi Wang

Neural Information Processing Systems 

V2A model based on rectified flow matching.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found