Microsoft's DeepSpeed-MoE Makes Massive MoE Model Inference up to 4.5x Faster and 9x Cheaper