Early Weight Averaging meets High Learning Rates for LLM Pre-training