Attention Forcing for Machine Translation