Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism

Open in new window