PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Jun-16-2026, 23:12:07 GMT–Neural Information Processing Systems

A penguin is standing on the lawn, with a giraffe behind it. A young man stands in front of the Statue of Liberty. A man in a tuxedo stands beside Tokyo Tower. A man is standing next to a traditional Japanese lantern. A woman is looking at a small, fluffy dog.

artificial intelligence, arxivpreprintarxiv, wang, (13 more...)

Neural Information Processing Systems

Jun-16-2026, 23:12:07 GMT

Conferences PDF

Add feedback

Country:
- Asia
  - China (0.14)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture > Tokyo (0.24)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found