Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Open in new window