4+3 Phases of Compute-Optimal Neural Scaling Laws

Open in new window