AutoScale: Automatic Prediction of Compute-optimal Data Composition for Training LLMs