A Minimalist Optimizer Design for LLM Pretraining

Open in new window