ml-logistic regression
ML-Logistic Regression
There are other optimization algorithms than gradient descent. These algorithms automatically pick the appropriate learning rate alpha, and are usually faster. One way to do this is to do a "One vs all" binary classification. To do this we do a binary classification with a certain class and all the other classes, and select the largest one that has the largest hypothesis output. Since we have 3 classes here, we do the binary classification 3 times.