Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts

Open in new window