Stochastic gradient descent with gradient estimator for categorical features