Chain-of-Talkers (CoTalk): Fast Human Annotation of Dense Image Captions