Revising Image-Text Retrieval via Multi-Modal Entailment

Open in new window