LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Open in new window