VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

Open in new window