Gradient Descent on Logistic Regression with Non-Separable Data and Large Step Sizes

Open in new window