A Visual Explanation of Gradient Descent Methods (Momentum, AdaGrad, RMSProp, Adam)

Open in new window