Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
–Neural Information Processing Systems
During optimization, contrastive learning keeps the different modalities separated by a certain distance, which is influenced by the temperature parameter in the loss function. Our experiments further demonstrate that varying the modality gap distance has a significant impact in improving the model's downstream zero-shot classification performance and fairness.
Neural Information Processing Systems
Feb-9-2026, 17:35:54 GMT
- Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.05)
- Genre:
- Research Report > Experimental Study (0.46)
- Industry:
- Education (0.46)
- Technology: