Teaching Computers to Describe Images as People Would