Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

Open in new window