Scaling Laws for Downstream Task Performance of Large Language Models

Open in new window