Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition