Enabling MoE on the Edge via Importance-Driven Expert Scheduling

Open in new window