When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions

Jun-23-2026, 03:52:38 GMT–Neural Information Processing Systems

Existing Moment retrieval (MR) methods focus on Single-Moment Retrieval (SMR). However, one query can correspond to multiple relevant moments in real-world applications. This makes the existing datasets and methods insufficient for video temporal grounding. By revisiting the gap between current MR tasks and real-world applications, we introduce a high-quality datasets called QVHighlights Multi-Moment Dataset (QV-M2), along with new evaluation metrics tailored for multi-moment retrieval (MMR). QV-M2 consists of 2,212 annotations covering 6,384 video segments.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Jun-23-2026, 03:52:38 GMT

Conferences PDF

Add feedback

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.68)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology
  - Communications (0.68)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning > Neural Networks (0.93)
    - Vision (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found