Information-Theoretic Foundations for Neural Scaling Laws

Open in new window