Methods of improving LLM training stability