Visual Salience and Reference Resolution in Situated Dialogues: A Corpus-based Evaluation