EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining

Open in new window