Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Open in new window