Pre-Training LLMs on a budget: A comparison of three optimizers