MEWL: Few-shot multimodal word learning with referential uncertainty