Towards Open-Vocabulary Video Semantic Segmentation

Open in new window