Unbiased Measurement of Feature Importance in Tree-Based Methods

Mar-12-2019–arXiv.org Machine Learning

This paper examines split-improvement feature importance scores for tree-based methods. Starting with Classification and Regression Trees (CART; Breiman, 2017) and C4.5 (Quinlan, 2014), decision trees have been a workhorse of general machine learning, particularly within ensemble methods such as Random Forests (RF; Breiman, 2001) and Gradient Boosting Trees (Friedman, 2001). They enjoy the benefits of computational speed, few tuning parameters and natural ways of handling missing values.

categorical feature, feature importance, random forest, (14 more...)

arXiv.org Machine Learning

Mar-12-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States > New York > New York County > New York City (0.04)

Genre:
- Research Report (0.84)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
- Banking & Finance (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Ensemble Learning (1.00)
  - Decision Tree Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found