The Two Phases of Gradient Descent in Deep Learning

Open in new window