Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos

Open in new window