Robust Optimization for Multilingual Translation with Imbalanced Data