Using Multimodal Deep Neural Networks to Disentangle Language from Visual Aesthetics

Open in new window