Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts

Open in new window