Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following