LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

Open in new window