Instability in Downstream Task Performance During LLM Pretraining