Maximal Initial Learning Rates in Deep ReLU Networks

Open in new window