A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

Open in new window