Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement

Open in new window