Compact Rule-Based Classifier Learning via Gradient Descent