Effect of Activation Functions on the Training of Overparametrized Neural Nets

Open in new window