Revisiting the Markov Property for Machine Translation