Learning Interpretable Rules for Scalable Data Representation and Classification

Wang, Zhuo, Zhang, Wei, Liu, Ning, Wang, Jianyong

Jan-29-2024–arXiv.org Artificial Intelligence

Rule-based models, e.g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity. However, rule-based models are hard to optimize, especially on large data sets, due to their discrete parameters and structures. Ensemble methods and fuzzy/soft rules are commonly used to improve performance, but they sacrifice the model interpretability. To obtain both good scalability and interpretability, we propose a new classifier, named Rule-based Representation Learner (RRL), that automatically learns interpretable non-fuzzy rules for data representation and classification. To train the non-differentiable RRL effectively, we project it to a continuous space and propose a novel training method, called Gradient Grafting, that can directly optimize the discrete model using gradient descent. A novel design of logical activation functions is also devised to increase the scalability of RRL and enable it to discretize the continuous features end-to-end. Exhaustive experiments on ten small and four large data sets show that RRL outperforms the competitive interpretable approaches and can be easily adjusted to obtain a trade-off between classification accuracy and model complexity for different scenarios. Our code is available at: https://github.com/12wang3/rrl.

activation function, logical layer, rrl, (10 more...)

arXiv.org Artificial Intelligence

Jan-29-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > San Francisco County > San Francisco (0.04)
- Asia > China
  - Beijing > Beijing (0.04)
  - Shanghai > Shanghai (0.04)
  - Zhejiang Province > Hangzhou (0.04)
  - Shandong Province > Jinan (0.04)
  - Jiangsu Province > Xuzhou (0.04)

Genre:
- Research Report > Experimental Study (0.67)

Industry:
- Education (0.67)
- Information Technology > Security & Privacy (0.67)
- Health & Medicine > Therapeutic Area
  - Neurology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Uncertainty (1.00)
    - Rule-Based Reasoning (1.00)
  - Machine Learning
    - Statistical Learning (1.00)
    - Decision Tree Learning (1.00)
    - Neural Networks > Deep Learning (0.93)