Guiding Exploratory Behaviors for Multi-Modal Grounding of Linguistic Descriptions