Faster MoE LLM Inference for Extremely Large Models