Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study

Open in new window