Detecting Concrete Visual Tokens for Multimodal Machine Translation

Open in new window