Learning Multi-Modal Word Representation Grounded in Visual Context

Open in new window