Supplementary Materials for MUVR: AMulti-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence Anonymous Author(s) Affiliation Address email

Jun-15-2026, 21:02:28 GMT–Neural Information Processing Systems

In this supplementary material, we elaborate on the MLLMs prompting details in Section 1. We1 further illustrate the annotation instructions in Section 2. Then, some visualization examples are2 provided in Section 3. Limitations and social impact are introduced in Section 4.3 The evaluation prompts for MLLMs are listed in Table 1 and 2. Although we attempted to maintain5 consistency across models, slight variations were necessary due to differing prompting requirements.6 We take the relationship annotation of9 the News partition as an example, while other partitions have different visual correspondences.10 3 Visualization11 Figure 1, 2, 3, 4 and 5 provide several relevant examples of different partitions from MUVR, with a12 text description of the query video and the tag of each video.13 MUVR relies on human annotators to annotate videos with rich semantics.

artificial intelligence, large language model, natural language, (11 more...)

Neural Information Processing Systems

Jun-15-2026, 21:02:28 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found