Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures

Open in new window