A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Open in new window