Acquiring Grounded Representations of Words with Situated Interactive Instruction