TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel
–Neural Information Processing Systems
Contrastive Language-Image Pretraining (CLIP) models maximize the mutual information between textual and visual modalities to learn representations.
Neural Information Processing Systems
Oct-9-2025, 23:36:42 GMT
- Country:
- Africa > Central African Republic
- Ombella-M'Poko > Bimbo (0.04)
- Europe > Switzerland
- North America > United States
- Arizona (0.04)
- Maryland
- Baltimore (0.04)
- Baltimore County (0.04)
- Africa > Central African Republic
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.93)
- Research Report
- Industry:
- Media (0.46)
- Technology: