When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
–Neural Information Processing Systems
Existing Moment retrieval (MR) methods focus on Single-Moment Retrieval (SMR). However, one query can correspond to multiple relevant moments in real-world applications. This makes the existing datasets and methods insufficient for video temporal grounding. By revisiting the gap between current MR tasks and real-world applications, we introduce a high-quality datasets called QVHighlights Multi-Moment Dataset (QV-M2), along with new evaluation metrics tailored for multi-moment retrieval (MMR). QV-M2 consists of 2,212 annotations covering 6,384 video segments.
Neural Information Processing Systems
Jun-23-2026, 03:52:38 GMT
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.68)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology:
- Information Technology
- Communications (0.68)
- Artificial Intelligence
- Natural Language (1.00)
- Machine Learning > Neural Networks (0.93)
- Vision (0.68)
- Information Technology