Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
–Neural Information Processing Systems
The core idea of contrastive learning is to pull the textual and visual representations of matched text-video pairs together and push the representations of unmatched text-video pairs apart.
Neural Information Processing Systems
Aug-18-2025, 17:30:17 GMT
- Country:
- Asia > China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Europe
- Spain (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Asia > China
- Genre:
- Research Report (0.93)
- Technology: