Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models