Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference

Open in new window