Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks

Open in new window