A Benchmark for Long-context Interleaved Video-Language Understanding Haoning Wu Dongxu Li Bei Chen Junnan Li Rhymes AI

Neural Information Processing Systems 

Specifically, as part of the question, it contains a referring query that references related video contexts, called referred context .