Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging

Open in new window