On Long-Tailed Phenomena in Neural Machine Translation