4+3 Phases of Compute-Optimal Neural Scaling Laws Elliot Paquette

Neural Information Processing Systems 

We consider the solvable neural scaling model with three parameters: data complexity, target complexity, and model-parameter-count.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found