Pareto Frontiers in Deep Feature Learning: Data, Compute, Width, and Luck

Open in new window