Scaling Laws for Downstream Task Performance of Large Language Models