Supplemental: A Benchmark for Compositional Text-to-image Retrieval

Open in new window