Muon is Scalable for LLM Training

Open in new window