Three Towers: Flexible Contrastive Learning with Pretrained Image Models
–Neural Information Processing Systems
LiT directly replaces the image tower with the frozen embeddings, excluding any potential benefits from training the image tower contrastively.
Neural Information Processing Systems
Oct-8-2025, 19:33:10 GMT
- Country:
- Europe
- Poland (0.04)
- Switzerland > Basel-City
- Basel (0.05)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- North America > Canada
- Europe
- Genre:
- Research Report (0.49)
- Technology: