HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition

Open in new window