Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Open in new window