Deriving Neural Scaling Laws from the statistics of natural language