Scalable and Efficient MoE Training for Multitask Multilingual Models