Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching

Open in new window