FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models

Open in new window