Improving Word Sense Disambiguation in Neural Machine Translation with Salient Document Context