$μ$-MoE: Test-Time Pruning as Micro-Grained Mixture-of-Experts

Open in new window