Do Lessons from Metric Learning Generalize to Image-Caption Retrieval?

Open in new window