Observational Scaling Laws and the Predictability of Language Model Performance Chris J. Maddison 2,3

Open in new window