FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement