Dealing with Semantic Underspecification in Multimodal NLP