An Empirical Study on the Robustness of Massively Multilingual Neural Machine Translation