Beyond Coarse-Grained Matching in Video-Text Retrieval

Open in new window