Rebalancing Contrastive Alignment with Bottlenecked Semantic Increments in Text-Video Retrieval

Open in new window