VLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval

Neural Information Processing Systems 

Compositionality is still a challenging problem.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found