Pareto Frontiers in Deep Feature Learning: Data, Compute, Width, and Luck Benjamin L. Edelman

Neural Information Processing Systems 

This work investigates how these complexities necessarily arise for feature learning in the presence of computational-statistical gaps.