Who are you referring to? Coreference resolution in image narrations