Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation