Training Deeper Neural Machine Translation Models with Transparent Attention

Open in new window