Gradient Descent, the Learning Rate, and the importance of Feature Scaling

Open in new window