Unbiased Measurement of Feature Importance in Tree-Based Methods
This paper examines split-improvement feature importance scores for tree-based methods. Starting with Classification and Regression Trees (CART; Breiman, 2017) and C4.5 (Quinlan, 2014), decision trees have been a workhorse of general machine learning, particularly within ensemble methods such as Random Forests (RF; Breiman, 2001) and Gradient Boosting Trees (Friedman, 2001). They enjoy the benefits of computational speed, few tuning parameters and natural ways of handling missing values.
Mar-12-2019
- Country:
- North America > United States > New York > New York County > New York City (0.04)
- Genre:
- Research Report (0.84)
- Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
- Banking & Finance (0.46)
- Technology: