VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval

Open in new window