Contrastive Language-Image Pre-Training with Knowledge Graphs

Neural Information Processing Systems 

Figure 1: CLIP fails to accurately capture some fine-grained semantic information.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found